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SEQUENCE LISTING 

<110> Wong et al . 

<12 0> CLONING OF RHADINOVIRUS GENOME AND METHODS FOR ITS USE 
<130> 53683 

<140> 60/107,507 
<141> 1998-11-06 

<140> 60/109,409 
<141> 1998-11-20 

<160> 179 

<170> 

<210> 1 
<211> 133719 
<212> DNA 

<213> Macaca mulatta rhadinovirus 17577 

1 GATCGGGAAA ACGCGAGGGG AGCGGGGGAC AGGGGACGGC GTGTGCGTGC TTGTGAGACA 
61 CCGGGTACGG CTGCCTGCCT GCTCGCTGGC CTGCTTGCTG AGGGGACAGT AGGCCTGCTT 
121 GCTCGCTGGC CTGCTTGCTG AGGGGACAGT AGGCCTGCTT GCTGAGGGGA CAGTAGGCCT 
181 GCTTGCTCGC TGGCCTGCTT GCTGAGGGGA CAGTAGGGCT GCTGGCTTGC TAGTAGGGCT 
241 GCTCGCTGGC CTGCTTGCTT GCTCGCTGGC CTGCTTGCTT GCTCGCTGGC CTGCTTGCTT 
3 01 GCTCGCTGGC CTGCTTGCTT GCTCGCTGGC CTGCTTGCTT GCTCGCTGGC CTGCTTGCTT 

3 61 GCTCGCTGGC CTGCTTGCTT GCTCGCTGGC CTGCTTGCTT GCTCGCTGGC CTGCTTGCTG 
421 AGGGGACAGT AGGCCTGCTT GCTTGCTAAG GGGACGGTAC GCCTGCCTGA TGGCTTGATA 

4 81 GTAGGGCTGC TGGGCTGCTA GTAGGGCTGC TGGGCTGCTA GTAGGGCTGC TGGGCTGCTA 
541 GTAGGGCTGC TGGGCTGCTA GTAGGGCTGC TGGGCTGCTA GTAGGGCTGC TGGGCTGCTA 
601 GTAGGGCTGC TGGGCTGCTA GTAGGGCTGC TGGGCTGCTA GTAGGGCTGC TGGGCTGCTA 
661 GTAGGGCTGC TGGGCTGCTA GTAGGGCTGC CTGCTGGCTT GCTTGCTTGC TTGCTAGTGG 
721 GGCCGCTTGC CTGCTACTAG GGCTGCTGTG CAGCTGGGAG AACAGAGTAG GGCTGCCGGC 
7 81 CAGCTGCGTG CGAGGGCGTC CGAGGGCCAG ACGAGGACAC GGGACCCGGG CCTCTCCCCC 
841 AGGCACAAAG CAGTAGGGCT GGCCAGGGGA AACAGTAAGG CTGATTGCTT GCTGAAAAAC 
901 AGTAGGGCTG CTGGTTTGTT GCTAACGGAA AAAGGGGAGG TGTGTGTATG CTTTTGTGAC 
961 TTTCCAGCGG AGGGGGTACA GTGCACGGCC AAGTTACAAG CACCTGCTTA ACTTGCTTTG 

1021 GCTCTGTGCG GTTTTGTTGC TAGGTACTAG TAGTAACACA TAGTATTTCA TCGCATGGCG 
1081 CCTGCATACG CTCCACAGCA TCCGAAACAC GTTATTTCTA TAGACTAACT TTAGTGTTCT 
1141 TCTTGGTTGG TACCATTTTA CGAAGTTTGC TCCGTTGAAT AATAAGAAAT CCGTTGTGGT 
1201 TACAATACAC CTGCCAACGT ATTGGATGGT TCCTTTTGCC ATCAACCTTT GCTTGCATTC 
1261 TAACATTGAA TTTTCACATT ACACCCTAGT CTTCTCATGC AATTAGGTAT GTCTTTGCTC 
1321 GTGTTTAATT TTCTCTACAC ATTTCTGTGG AAATGTTTGT GTTGGTTTTA TTTATGTTAT 

13 81 TGCAACCTGT ATCGGTGGAG CTTTTGCCTG CAAAATTAAC TTCTGTTCCA ACGTGGTGTC 

14 41 CACCACATCC CGGAGATACC TACTTGCTAA CCTGCCGCGG GACGTCTACG GCCAGAGACC 
1501 AGCGAAGCAC ACAATGGTTT CGCAACAACA CGCTTATGCG TGGGAGTAAT TTCTACGGCA 
1561 GACTGGTATC TGTGACTCCC AATGCTACGA TATCTGACCG GTATGCGTGT CAAACAAAAA 
^621 CAACAACGCG GAGTAACAAC ATCGATTTTC GGGTAAGCTC ATCGCGCCTC ACGCTCCAAG 
1681 AACGGTGCTC TTCATACGGC TATACTTACG CGAATAACAC AAGGGTATTG AGGTGTTACT 
1741 CTGGTGGAAA CGTAACTTTA AGAAACGTTG TCTTTCATTT AAACGGTACA GCGGTCATCA 
1801 ACGGTACTAC AACAAACATA CATACATTTG TGTTAACAGA AAAGACAGGA GGGACGTATT 
lell ?C?GTTCTGC GTTTATTGGG AATGAAAAAT TCTATTCTCA GACAATTAAT GTGTTmTA 
^|21 CTTCATTTAC CTTTAAACCT ACAAACGACA TTCCCAATGA GTCACATTTT AATAAAACTG 
1981 GGCAAATACA ACAAACAGCT AGTGTACAAC ATCCTGAAAA CTACGTTGTG TTCTCTGTTC 
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2 041 CCGTTTTTTC TATTGGCGTT TTAACAGGTA 
2101 TTACAATACG CTGCAACGAG AACTCTGAAT 
2161 GCTACATTCA ACCCTCCCAT AATCAGCGTT 
2 221 ACAGAAATGC TCATCAAGAA GAGAGTATTG 
22 81 CTGATTCTTG CTGTCAATTA GTTTTACTTG 
2 341 AGGAAAACAC AATTAACGAA GTTATGGAAC 
24 01 AACAAACATC ATATGAGGAT AATGTTGAGC 
24 61 ATTTTAATTA CTACAGTGGA CTAATATTGG 
2521 TAGAAAATCA ATATCATGGA TTAATACTGG 
2 581 TAAATGAATT AAACATGATA GAACAATATG 
2 641 TTTTAAATTA AAACATAAAC AACATTAAAG 
27 01 AAAATTTTTT TCATATACCT GAAACTTATA 
2761 TTTGCCCGGT ATCTCTGATA ACATGGTGTA 
2 821 ATCACAATCA AAACTCTCCA TAATGCGCGT 
2 881 AAGAACACTC TCATAAACAG ATTTTCCTCC 

2 941 TAGTTTGTAT TGTCTATAGA AATTAAACGC 

3 001 TCCGTGCGGT GGTTCACGTA ATTCTCTGCT 
3 0 61 GCGCTTCTTT TCAGGGATGG AAAACCATGT 
3121 ACCTACAACC GATGGCGTAG ACGTCATTTT 
3181 CGGCCAAGGC ATTGTTCCGT TTTTACCAAT 
3 241 GTTAACTGTA ATGTCCATGG TTTTAGTTGC 
3 3 01 CAATTATAAA CAAATAACCC ACGTGACATG 
3 3 61 TAGTTAATCA TATTCGCTTT CACATAATGG 
3421 CCTACCTTGT TTGGATGGAT TTATGGTTAC 
34 81 TTTTACGTAA AACATATTGT AATTTAATCA 
3 541 ATGTTTGCAT AAAAGCACTT ATGTTAGTGA 
3 6 01 TTAAAGAGGC ATGTGAATAT ATTTAACACG 
3661 ATAACTGGTA AAAACATGAC GTTTAAACTT 
3721 TACGTCCACT GCGATGAAAA CTGTAAACCT 
37 81 AACACAGAAA AGGACTTATA TAGTGTTGGA 
3 841 TATGTTACAA ATACAAAAAT AATAACAACA 
3 901 CCAAATTTTC CATGCGACAG AAAAAGATGT 

3 961 GTGCACATTC ACGGGGGAGA TAATGCCTTA 

4 021 AATGAAGGTT ATGATTTAAT TGGTAGTAAT 
4 081 AACGTAAATT GGGATTCAAA TGAACCAGTC 
4141 GCAGTGGAAC ATGGGGACTA CCTACCTAAC 
4 2 01 ACATTTAAAT GTTCATTGTC GTATACACTC 
4 261 AACAAAAAGT GGTCAAACTC TTTCCCAACG 
4 321 ATAGACAATG GGTACATAGA CATTGGCTTG 

43 81 ACTGTAAAGT GTAGCGACGG GTACAACATT 

44 41 ACAACTTGGG TTCCACCATT ACCTAAGTGT 
4 501 ATGCCAGAAA CACCCATGCC AGAAACACCC 
4 561 ACCGCTAAAA CTGCAACAAC ACCAAATGCG 
4 621 GACGACGTTA CTTGTGTAAA GCCTCATTTT 
4 681 AAGGAAAAAT ACAGTGTTGG TGCGAGCGTT 
4 741 ATGCAGTCTA CAGTTTCTGT TGAATGTTTG 
4 801 AAGTGTCATA GAAAAAAATG TCCAACCCCT 
4 861 ACAAGCGGAG AAGATGCTTT TAAGTACGGA 
4 921 TATCAACTTT TAGGAAGTAT GGTGCGGATT 
4 981 GACTGGGAGC CAAAAGCGCC TATATGTGAT 
5041 ACAAACGGAA AATACCATCC GGTGAAAGAC 
5101 TCGTGCAATC GTGACTTTTC TTTAGTTGGA 
5161 TGGAATAAAC CGTTTCCAAG ATGTGAACAA 
5221 GGAAAGCTGC TAACAGGTTC TTCAAGCGTT 
52 81 TGTGAAACTG GATTTACTCT AATTGGCAGT 
5341 GATCCACCAC TTCCTACGTG CGTGCCAGCT 
54 01 GAAACCAAAA AACCAAACAC GCCAACGCCA 



TTGCAATATC GTTGATTATG TGTTGGTTAT 
CATCAACTAA TAGTTATGCA AGCCAGACAA 
CCAATACTAA TGAATGTAGT CGCCATACCT 
AAGAACTACC AAACCAACAC ACAAGTGAAA 
AAGTGAAAAA TGTAGCCTAC GATGGACCGC 
AGTATGATGA TGTGGTTGTA AAAAATATAG 
ACATGGACTA TAGTGATACT ATAAATCCCA 
AAGAAGTAGA TGAAGTTTTT TACAATGAAC 
AGAATTTAGA TCACAATGAG TACAATCATT 
ATTGGTTAGA ATAAATAACT TGTGGGTTAT 
CAGCATTTTT GTGTAAATCC TTTATTTATT 
TTTAATTCCA TTTTCCTCAA AGTTTGTATC 
TTCGGTAAAA TTAATTGATG GAAAAAATAC 
AATATATAGT TTTAAGGGAC ACTTATAGTT 
AATAACCCAG ACCGTGTTTA ATTGTTCTTT 
ATCGTCTAGC GTTCTCGCTA GAAAGTGTGC 
CAAAATAATA TTAATTCTGT TCACCAGAGG 
TCTTTTCCCC ATAATAACCA CATTCTTTTC 
CTGAAAATAC ATCATTTCGT TTCTAAGGTA 
TCCTAATTGT TCATCAACTG CAACGATGCA 
CACTGACGGG TTTAACACAG AAGTATTTCA 
TACTTACTAA TGTAAGTACG TAACGTGATA 
ACAAAACGAA AAAATGTTAC CTGTGTTAAG 
AATGAAAAAT AAAAATATAT ATATAACGTT 
CGTGATGTTT ATGTTGGCTA CTAACACACA 
GCTTCAATCT TTATATTTTA TTCAT^TTGT 
CATTCAGAAA TAGGCGCAAC GCTGTGCCGT 
TTTCCTCTGT TTTTATTACA CGCCATAATG 
CCACATTTCA CGGAATATCG CGTCAAGTCT 
GAAACAGCTG AATTAATTTG TCGTCCTGGT 
GAATGTTTAC AAAATGGTAC GTGGTCAAGA 
CCCACACCTG CTGACTTGCT GAATGGAGCC 
AAATTTGGAT CCAATATTTC CTATGAGTGT 
GTTCGTTTTT GTATTTTACA AGACACAGAA 
TGTGAAATTC AGAAATGTAT TAAACCACCG 
CAAGATGTTT ATAACTATGG AGATGCAATT 
GTTGGATCAA CAACATTAGT ATGCACGTCA 
TGTTTAATGC TCGTATGTGA AAGTCCACAA 
TCCAGAAGAT ACAACCATGG ACAATCAATT 
GTTGGGCCTG AAACATTAAC GTGCACAAAC 
GTATTAGTTA CAAATAACCC AAGCACACCC 
ACGCCAGATT ATCAAAAAAT AAATTTGTCA 
TTTGTTACAA CTGTTGTTTC TCCAGAGAAA 
GAGCGATTCA TGGTAAAGGC TGAAAATGAC 
GAGCTAATAT GTCGACCAGG ATTTACTAAA 
TCCAACGGAA CATGGACTGC TCCAAATGCC 
CAAGAACTTT TAAACGGAGA GTATATAGTT 
ACAAATATAA CATATAAATG TAATGAAGGT 
TGTATGCTTA AAGACGATTT AAAAACAGTT 
ATTGAAAAAT GTAAGCCACC GCCACAAATT 
TTTTATCAGT ATTTGGACAC CGTAACATTT 
GATGAAATGA CAACGTGTAT AAGTAATACG 
ATCACTTGCA GCGCTCCTAA TATTGCACAC 
TACAAATACG GTCAATCTGT TACCATTGGT 
GAAATTTCTA CATGCAAGGA TTCATCGTGG 
GTTTCAATGC CTTCTGACAC ACCTAAACCA 
GAAGCACCCA AACCAAACAC CCCAAACGTT 



7 



o ^fl 3'i J.. Q O n=J Q 23 O 2 O J.. 



<x<»i> rOJ.. Iltnt 



WO 00/28040 



PCT/US99/26260 



54 61 GGAACACATA CACCATTCAA ACCACCACCA CAAAATCCAC CAATAGCACC CCCAATGAGT 
5 521 AAATGGAAAA GGCATGTCGT GTTAGTTCTT TTTGCAAGTG TCGCGTCCTT GTTATTCGTA 
5581 CTTGCTGCCC TTTATTGTTG TTTTCTAAAA TAACTGTTTT TTGTCTTCAG CAGGTTCGCC 
5 641 AGGCAAACTC GCACGCATTA ACCAATCTGC CAACCGCCGT TGATTCCGGA ATTAAGTTTA 
57 01 CATTATTCAA GGTTGCCAAT AAAGGTGGTT TAAAAATATT CTATTGGTGT TCATTGTTTT 
57 61 ATGTTGACCC GTTTATAGTT ATCGCGCCAC CTTGTGGCTA CATTATATAG CACGATCACT 

5 821 TTCCACGTTA TACTTTCACG TACTATGACT CATACGCCTT AACGTCACGT GGCGTGCGAT 
5881 TGTGGCCGGG GCTGAAAATA ACACAAGGGG TACATAATCC ATCCAGGCGG CACACATTAG 
5941 ACACGGTTTA TAAAACTATA TCGGATGCGC CAACAATCAC TGTCGCTAGC GACACTGATA 

6 001 GAAAAACATT TTAACGTTTG TTTAGCGAAC TTGAATAACA CATAATGGCT TCCAAAGGCA 
6061 ACGCCGGACA ACCCCTGGAA GATAATCAGG GGTCTCGTGC CCCGATAGGT GCGTGCGGAT 
6121 ACGTGTACGC GTATTCGAAA CAAGACTTTC CCTTTGCCGA GGCGTCCATA CTCGGCAACA 
6181 GACCATCTGG ATCTGGCGTT TTCTCGCTAC CAATCCTTTA CGGACTTACA GTTGAACACG 
6241 AATTCCCTCT CACCGTAAAA GCCGCATACA AAAAAGTTGA CACCACGACG CTCGCCGTTA 

63 01 AGGTGACGTG CTTTCACAGA GAGGTTATTG TGTTTCACAA TGCAAGTTTA TTCAGGCCGG 
6361 TGTTTGACGG TACCGGTCTT AACGAACTAT GCGAGGAAGC CAGGGCTCTC TTTGGGTACA 
6421 CGCAGTTTAT AGAACCGGGT CCACCTCACA GCATATGGAA CCCTCTGGAA TGTCCGCAGT 

64 81 TACCGGACAA GGATGAGATG TTTCTTGGCG TTGTTGTTAC GGAAGGGTTT AAGGAAAGAC 
6541 TGTGGAGGGG CTGTCTCGTT CCCGCGGTGT TCCAGACCCA GCAGGTGCAG ATTGCCGGAC 
6601 GCCAGGCGTT TAAAGTGCCG TTGTACGACG AAGACCTGTT TGCACCTCAC GGTCATAGAA 
6661 TGCCAAGGTT TTACCATAAA GACGTTAGCG CGTACCTCTA CGACTCCCTC TTTACCAGCA 
6721 TCGCCCAGGC CCTGAGACTC AAAGACGTGA CGGCGGTCAT CCACGCCACA GAAAAGCAAT 
67 81 TCATGCAGGA CCATTACAAA ATTGCCAAGA TAGTGCAGGC AAAACAGTTT TCAACGACGC 
6 841 TGCCGAAAAC GACAGACGGG TCGTCCCACA TGATTGTGGA CAGCGTCGTC GCCGAGCTCG 
6 901 CCCTTAGTTA CGGCTGTATG TTTCTCGAGT GTCCCCAGGA CGCGTGCGAG TTGCTGAACT 

6 961 ACGATAGCTG GCCCATATTT GATGGTTGTG ACTCACCAGA GGCTAGGGTT AACGCGTTAG 

7 021 AGCGCTGGTC GGCCGAACAG GCCGTTCACG TGGCGGGTCA ACTGTTCGCT GCCAATTCGG 
7081 TGCTGTACCT AACTAAAGTG CAGAAGCAAG CGCCCAGGGG ACAAAAGGGA GACGTAAACG 
7141 TGTACAACTC CTTTTTCCTC CAACACGGAC TGGGGTTTTT AAATGAGGCC ACGATCAAGG 
72 01 AAAACGGCAG CGAAGCCTTT AAGGGCGTAC CCTCAAACGC CCTCGATGGT TCTTCGTTCA 

72 61 CGCCGTATCA CCTGGCCTAC GCCGCGTCTT TCTCGCCCCA TCTGCTGGCG AAGTTATGTT 

73 21 ATTACATGCA GTTCTTGCAA CACCACAAAA GCTCCACGAA CCAGGCGTTT AACATGGTCC 

73 81 ATTATGTCGG CACCGCCGCC AACTCAGAGA TGTGCACGCT ATGTCACGGC AACACGCCGG 

74 41 CAACGTGCCT CAACACGCTG TTCTATAGAC TGAAGGATAG GTTTCCCGCC GTAACCACCC 

75 01 CTCAGCGCAG GGACCCCTAC GTGGTGACCG GAACAGCCGG GACCTTTAAC GACCTGGAGA 

75 61 TTCTGGGCAA CTTCGCGAGC TTTAGAGACC GCGAAGAGGA CGGAAACCCG GCCGACGAGC 
7 621 ACCCAAAGTA CACGTACTGG CAGCTATGTC AGACCGTGAC AGAAAAGCTA TCCGCGATTG 

76 81 GAATCACCGA AGACCACGAT AATCACGTGA ACCTCATCAC CAACATCCAA AGTTTTCTCA 
7 741 GGGTGTTCAA GGGTATCGAC TCAATTGTGG ACGGAGAGGT CATGAAGTTC GTTAATTCGA 
78 01 TGATTAAAAA TAACTTTAAT TTCCGCGAGC ACGTCAAATC GGTCCATCAC ATACTCCAGT 
7 861 TCTGCTGCAA CGTGTATTGG CAGGCGCCGT GCGCGGTGTT CCTGAATCTG TACTACAAAT 
7 921 CCCTGCTGTG GATCATTCAG GATATCTGCC TGCCGTACTG CATGATCTAC GAACAAGATA 
7 981 ATCCGGCGAT GGGCATCCTC CCCTCCGAGT GGCTAAAGAT GCATTTTCAG ACGTTGTGGA 
8041 CGAACTTTAA AGCGGCGTGT CTCGACCGCG GGGTCCTCAC GGGGTGCGAA CTGAAAATCG 
8101 TACACCGGGA CATGTTCTGC GACTTCTTCG ACACCGACGC GGGGTCCAAC GGCTTAATGG 
8161 CCCCCTTTAA AATGCAGGTT AGAATAGCCC GAGCCATGAT GGTCGTTCCG AAATCAATTA 
8221 AAATAAAAAA TAGAATCATT TTTTCCAACA CCGCGGGATC CGAGGCGGTG CAGTCGGGGT 
82 81 TCGTCAAACC GACGGGAACC AGGGACACTT ACGTGGTGGC CGGACCGTAC ATGAAGTTTC 
8341 TCAACTCGCT GCATCGCGCG CTGTTCCCCG ACACCAAGAC CGCCGCGCTG TACCTGTGGC 
84 01 ACAAGATCTC CCAGACCAAC AAAACCCCAG TTCTGAAAGA CGTCCCGGAC GACGAGCTGG 

84 61 CGGAGCTGGT GTCGTACGTA AAGACCAACA GCCTCGCGTT CGAGGAAACG AACGTGCTGG 
8521 ACGTGGTTCC GGATTCACTC ATGTCGTACG CGAGGATCAA ACTGAACGGG GCCATTCTAA 

85 81 GGGCATGTGG CCAGATTCAG TTCTACGCCA CGAGGCTGCA CTGCCTCACG CCGGTGCTAC 
8641 AGACGATCGA TGCCGAGGAA TACCCCCACG TGCTGGGCTC CGCGGCAATC GCCACACCGG 
8701 TGGCTTACCT GGCAGAAATA CGCGGCCGCA CCGCCCTCAC CGTCCAGACG ACGGCGCGTC 
8761 AGCCGGTCGC CGCCACAGGG CGCCTGCGTC CCGTGATAAC CGTTCCCATG GTAGTCAACA 
8821 AATACACGGG GGTCAACGGG AACAACAACG TTTTCCACTG CGGAAACCTG GGGTACTTCG 
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8 881 CGGGGCGCGG CGTGGACCGC AACCTGTGGC CGGAAAGCTC CCCCTTTAAG AAAACGGGCG 
8941 TCAGCGCCAT GCTAAGAAAG AGACACGTCA TGATGACCCC CATTATCGAC CGCCTAATAA 
9001 AGCGAGCCGC GGGACAGACA ATCAGCACGT TCGAGGCGGA AAGCGTTAAA AGGAGCGTGC 
90 61 AGGCGCTGTT AGAGGATAAG GACAACCCTA ACCTATTGAA GTCGGTAATC TTGGAGCTTA 
9121 TACGACACCT GGGGAAGGGC TGCCAGGACT TAAGCTCCGA GGACGTGCAA TATTACCTCG 
9181 GTGACTATTG TATGTTGACG GACGAGGTTT TATTTACGTT GGATAATATA GCACAGTCAG 
9241 GCGTGCCGTG GACTATCGAG GACGCGGGTG CCCTAATAGA GGATCGCCAG GACGCAGACG 
9301 ATCTTCAGTT CGTAGACAGC GACGATATCG CCACCGCTTC CTGTCAGCCC CCCGAGGAAC 
9361 AGCTACCGAC CCCTAGCGCC GGCGCCCTAC TGGCCGGGAA GAAGCGAAAA ATTAACGCGC 
9421 TGCTGAGCGA TCTAGACCTT TAGGAAAACC GTGGCAGGCG GGCAACAATG GCCAGGGAAC 
94 81 TCGCAGCATT ATACGCGCAG CTGTCGGCCC TCGCCGTCGA CTTGAGTCTG GTTATCTTTG 
9541 CGGACCCGCG AAGTATCGAC GGTGCCCGCA TTCTAAAAAC AAAAACACAG ATAGAGAACC 
9601 TGAACCGCGA CCTTCTGCCG CTGCTACGCG AGCAAAACTC GGTAGAGACG TCCAGCCTGT 
9661 CGCTCGAAGT GGAGCACCTG GCCAAAAACA TCGAGGACAA ACTCGGCGAG CTGGAGCGCA 
9721 GTCTGCGGCA GAGATATTCG AGCCGAGAGC ATTTTGAAAC ACTACACCTG AGACCCGAAT 
9781 GTCACTATCA CTCTACGGTT ACTTTTCAGT TTTACGGGGG CGGGTTAATA GATGTAAACA 
9841 TGTGCCTAAT AAACGATGTA GAACTGCTGT GTAAAAGACT AGGGAGTGTG TTTTATTGCA 
9901 TCGGTGCAAA CGAAGCTCTG TCCGGATTGA ACCGGGTTCT GACGTTTCTG TCAACACTGC 
9961 GGGGTATCTC CCCGATCCCG CACCCAGACC TATACGTCAC GTCAGTGCCT TGCGTACAGT 
10 021 GCCTGAGGGA AATCGAACTC GTACCAAATC AGGGGTCCAG TTTACTCGCG GTGTTGGCAG 
10081 ACCGACACTG CGATCACCTC TGTAAGAAGG TTAGGGCGGA GCCAATACAC GGCCTGTTTG 
10141 AGACAGAACT GAGCCAGCTG GGTCTAAAAG TAACAAAACG TTCGGACGCC ACGCAGCACG 
10201 GCGTCCGGTC CTCTGCAGAT CAGTTAAGGG AGTCGTCGCT GGCGGCCATA CAAGATCACA 
10261 ATATATTCAA ACGGGTGTCC GCGTCAATCA TGGAACTATC CAATCTAATT TATTGGAACG 
10321 CCGGGCAAAC CGGCCTCCAG ACCGGGACCG AAAACGAGTG CTCACAAATG GCCAGACTGC 
10381 TAACACACGA GGCCGATATG CACGAGCACC GTGCGCTAAT AACACCCAAA CTAAGCGCGA 
10441 CTCACTTCTA CGACTGTTTC CGACCGGATC CCATAGAATC CCTGTTCTGC GGCGGTCTTT 
10501 TTAACTCTAT AGACGACACC ATAAACGCAC TGAGCCGGGA TTGCTCCGTG ACGTTCTTTC 
10561 AACAGGCAAA CTATACCAAC GTTATGCGAA AACAAAACGA GCTGTTCACC AGACTCAATA 
10621 GCATCCTGCG TCAGGGGAGC GCGGGATCGC AAAAACCGGC CACCCCCTCG GAGCCACGGA 
10681 CCACCACCGT GGCGGCAACC GCGGCAAGCG ACGTCATCAA AGACGCACAG TATCGCAAAG 
10741 AACAGTACAT GAAAAAGGTG GCCAGGGACG GCTTTAAAAA ACTAACAGAG TGTCTGCAGA 
10801 CGCAGAGCGC GGTGTTGGCA AACGCACTCT GCATGCGCGT ATGGGGGGGC GTCGCCTACG 
10861 GCGAGGCGTC CGAGCTGGTG AACCATTTTC TCCTCAGGCG GCGCTTCGTC GCGCTTCCCT 
10921 GGGAGGCGCG CTGCCGCTCG GATCAGATTT TATTCGAAAA CTCAAAGTAC ATTAAAAACT 
10981 CACTATATTC CCAGCGCCTC AGTCGCGAAC ACGTAGAGAT TATCACGCTG CAGTTTTACG 
11041 GCCTGATAAC CGGCCCCCTG ACGCGCCAGA GCGATCTCTT TCCCGGCCCC GCCAACGTCG 
11101 CGCTGGCCCA GTGTTTCGAG GCGGCCGGAA TGCTTCCGCA TCACAAGATG CTGGTGTCAG 
11161 AGATGATATG GCCCCAGATT CAACCGAAAG ACTGGATAGA CCAGACATTT AATCGTTTTT 
11221 ACCAACTTCC CGAAGGTGAT CTCAACGCGG TACAAAAGTC CGCCTGGTGC TTTATACGAG 
11281 AGCTCGTCCT CTCGGTGGCG CTTTATAATC GCACGTGGGA AAAGACGCTG CGGATATTTT 
11341 CCCTAGCGCG CGAGAAACTC TCCATCTCCA ACCTAGACGT TAAAGGCCTG ACGTCCGGCC 
114 01 TGTATCTAAC GTACGAGCAA GACGCGCCGC TCGTTCTAAT TTCTCAAAAT ACCGGCTGGA 

114 61 TATTTAAAGA CCTGTACGCT CTTCTGTACC ATCACCTGCA ACTGTCCGAC GGCCATGATG 
11521 ATAACTAACC GAACGCGTCG TCTCCTGCGG GCGTGGGTCG TGATAATCGC GATCGGCACG 

115 81 GCGGTTGGCG AAAACGTCAC CACCCCCAAG GGCGCGACCA CCACCGCGAA GCCAACGCCG 
11641 GGCCCGTCGA CGCCCACACC TCCCGAGAAC CCACCTAGGG CCGAGGCGTT TAAGTTTCGC 
117 01 GTGTGCAGCG CCTCGGCCAC CGGCGAACTC TTCAGGTTTA ACCTGGAAAA AACGTGTCCG 
117 61 GGCACCGAGG ACAAGACGCA CCAAGAAGGC ATCCTGATGG TGTTTAAAAA AAATATTGTC 
11B21 CCGCACATCT TTAAGGTCAG ACGGTACCGC AAGGTGGCCA CCTCGGTGAC CGTCTATCGA 
Asll GGGTGGACCG AGACCGCCGT GACCGGCAAG CAAGAGGTCA TCCGACCGGT GCCGCAGTAC 
11941 GAGATCAACC ACATGGACAC GACCTACCAG TGTTTCAGCT GCATGCGCGT AAACGTCAAC 
^2001 GGCATAGTAA ACACCTACAC GGACAGGGAC TTCACTAACC AGACCGTGTT TCTGCAACCG 
12 061 GTCGAGGGGC TCACGGATAA CATCCAGCGA TACTTCAGTC AGCCGGTGCT GTACACGACA 
12121 CCGGGATGGT TTCCGGGAAT TTACAGGGTC AGAACCACGG TCAACTGCGA GATCGTGGAC 
^2^8^ ATGATCGCGC GTTCGGCGGA ACCGTACTCG TATTTTGTCA CCGCCCTGGG AGACACGGTA 
lllll GAGGTGTCCC CGTTCTGCCA CAACGACTCA ACGTGCTCGG TCGCGGAGAA AACCGAAAAC 
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123 01 GGCCTCGGCG CCCGCGTGCT CACAAATTAC ACCATCGTCG ACTTCGCGAC CCGCCAGCCC 
12 3 61 ACCACCGAAA CGCGGGTCTT CGCCGACTCG GGAGAATACA CCGTATCGTG GAAGGCGGAG 

124 21 GACCCCAAGT CGGCGGTCTG CGCGCTGACG CTCTGGAAAA CCTTCCCCAG GGCGATACAG 
124 81 ACGACGCACG AGGCCAGCTA CCACTTCGTG GCCAACGACG TGACGGCGAC CTTCACGTCC 
12 541 CCGCTCTCCC AGGTAACTAA CTTCACGGGC ACGTACCCCT GCCTCAATGA TGTTATTCAG 
12 601 AAAACCCTCA ACGCCACCAT CAAGAAGCTG TCCGATACCC ACGCAACAAA CGGATCGGAG 
12661 C2^.GTACTACG AAACCGAGGG GGGTCTGTTT CTCCTGTGGC AGCCGTTAAC GCCGCTAAGC 
12721 C'^^^GCTGACG AGATGCGCGA ATTAAACGGC ACCACGCCAG CACCCCCCAC CACAACCTCA 
127 81 ACCGCCAACC GCGTTCGAAG AAGCGTCGGT ACGAACGAGC AGGCAACGGA CGACCTAGCG 
12 841 GCGCCCCAGC TGCAGTTCGC CTACGACAAG CTCCGCGCGA GCATCAACAA GGTGCTGGAG 
12 901 GAGCTCTCCA GGGCGTGGTG CCGAGAACAG GTGAGGGACA CCTACATGTG GTACGAACTG 

12 961 AGCAAGATTA ACCCCACCAG CGTAATGACG GCGATATACG GGCGGCCGGT GTCGGCCAAG 
13021 TTCGTGGGCG ACGCCATCTC CGTGACGGAC TGCGTGGCGG TGGACCAGGC GTCCGTCAGC 
13081 ATCCACAAGA GCCTCCGCAC GTCCACCCCG GGGATCTGCT ACTCGCGCCC CCCGGTCACG 
13141 TTCAGGTTCC TCAACAGCAC CACGCTGTTC AAGGGCCAGC TGGGACCCAG AAACGAGATC 
132 01 ATACTGACGG ACAACCAGGT GGAGGCGTGC AAAGAGACGT GCGAACACTA CTTCATAGCG 
132 61 AGCAACGTAA CCTACTACTA CAAAGACTAC GTCTTCGTGA AAAAAATTAA CACCTCCGAG 
13321 ATATCCACCC TCGGTACGTT CATCGCCCTG AACCTGTCGT TTATAGAGAA CATAGATTTC 
13381 AGGGTCATCG AGCTGTACAG CCGCGCGGAG AAAAAGCTGT CCGGGAGCGT TTTCGATATA 
13441 GAAACCATGT TCAGGGAATA CAACTACTAC ACGCAACGCC TGGCGGGACT CCGGGAGGAC 
13501 CTGGACAACA CGATCGACCT GAACCGCGAC CGCCTGGCCC GCGACCTGTC CGAGATAGTC 
13561 GCGGACCTGG GCGATGTCGG CCGCACGGTC GTTAACGTGG CCAGTAGCGT GATAACCCTG 
13621 TTCGGATCAA TCGTGAGCGG GTTCATTAAC TTTATAAAGA GTCCGTTCGG GGGCATGCTC 
13681 ATGATCCTGG TGATTGTGGC GGTCGTCCTG ATCGTGTTTG CGCTAAACCG GCGCACCAAC 
13741 GCCATCGCCC AGGCCCCCAT CAGGATGATC TACCCCGACA TAGACAAAAT GCAGCCCTCT 

13 8 01 GGCGGTAAAG TCGACCAGGA GCAGATTAAA AACATTCTCG CCGGCATGCA CCAGCTACAG 
13861 CAGGAAGAGC GTAGGCGGTT AGACGAACAG CAGAGGTCAG CGCCCTCGCT TTTCCGGCGC 
13 921 GCGTCAGACG GACTA7VAACG TCGCTTTAGG GGATATAAAC CGCTGGAAAA CGAAGAGGCT 

13 981 CAAGAGTATG AAATGAGCAA ATAACCACAC CCACACGCCT GTACTTGCCG CCCGCCAGAG 

14 041 CCGCGCGGCC AATCGATTCG CGCACCCGGC CGGTCCCCGA CACCCTCTAG CGCCCCCGGG 
14101 CGTCCGCGGT CTGTATCAAT CATGGATTTC TTTAACCCGT ACCTGGGCCC TCGCGGACCA 
14161 CGCCCCCACT CACACAGAGG CACCGATGCT CCCGCCCCTG CCGGCGCCGG AGCCGTTCAG 
14 221 CCGCCACCAG ACGTTTGCAG GCTCATCCCC GCCTGCCTCC GAACGCCAGG GGCAGGCGGG 
142 81 ATGATCCCGG TCACGATCCC GTTCCCGCCA ACGTACTTCG AGAACGGTGC TCGCGGAGAC 
14 341 GTGCTGCTCG CCAACGAACG GTCCATGTGG ACGGCGCGCG ACCGCAAGCC CGTCGCCCCG 
144 01 GACCCCCAAG ACCAATCCAT CACGTTTCAC GCGTACGACG TCGTTGAAAC AACGTACGCG 
144 61 GCGGACAGGT GTGCCGAGGT ACCTAGCCGC TTCCAAACGG ACATTATCCC AAGCGGAACC 
14 521 GTGCTCAAGC TCCTGGGGCG T^CCGAGGAC GGCACCAGCG TGTGCGTGAA CGTGTTCCGT 
14581 CAACAGGTAT ATTTCTACGC GAAGGTTCCA GCCGGCATTA ACGTCACCCA CATCCTCCAG 
14 641 CAGGCCCTCA AGAACACAGC CGGCCGTGCC GCGTGCGGCT TCTCGACCAG AAGAGTGAAC 
14 7 01 AAAAGAATTC TCAAAACGTA CGACGTCGCG GAGCATCCCG TCACGGAAAT CACGCTATCG 
14761 TCCGGTTCCA TGCTCTCGAC CCTCAGCGAC CGCCTCGTCG CGTGCGGGTG CGAGGTGTTC 
14 821 GAGTCAAACG TGGACGCCGT TCGCCGGTTC GTTCTGGATC ACGGGTTTAC CACGTTCGGG 
14 881 TGGTACTCGT GCGCGCGCGC CACGCCCCGC CTGGCGGCCA GAGATGCCAG GACGGCCCTG 

14 941 GAGTTTGACT GCAGCTGGGA GGACCTCAGC GTTCAAGCGG ACCGCAGCGA CTGGCCCCCG 

15 001 TACCGCATCG TGGCCTTTGA TATCGAGTGC ACTGGAGAGG CGGGATTTCC GTGCGCCACG 
15061 CGCGACGGCG ACGCGGTGAT CCAGATCTCC TGCGTCTTCT ACACGACCAG GGAAGGCGCG 
15121 CCCAATCCGC CAAACATACT GTTCAGCGTC GGGACGTGCG ACCCCATCCC GGACACCGAC 
15181 GTTTTGGAGT TTCCGTCGGA ATATGACATG CTGGTGTCGT TCTTCGCCAT GATCCGCGAC 
15241 TTCGAGGTGG ACTTTTTAAC CGGCTATAAC ATCTCAAACT TCGATCTCCC GTACCTAATC 
15301 ACGCGAGCGT CCCAGGTGTA CAACCTTCGA TTAAACGAAT ACACAAAAAT AAAAACCGGC 
15361 TCCATCTTTG AAGTTCACGA GCCCCGTGGC GGGGGAGGGG GGTTCATGAG GTCGGTCTCA 
15421 AAAATTAAAA TAGCGGGCAT CGTCCCCATA GACATGTACC AGGTGTGTCG CGAAAAGCTC 
154 81 AGCCTCTCCG ACTACAAACT GGACACGGTG GCCAGGCAGT GTCTGGGTGG GAAAAAAGAG 
15541 GACGTATCGT ACAAGGACAT TCCCCCTCTG TTTCGCTCAG GTCCGGGCGG CAGGGCTAAG 
15601 GTGGGCAGCT ATTGCGTGAT GGACTCGGTC CTGGTGATGG ACCTCTTAAA AATGTTTATG 
15661 ATACACGTGG AGATTTCGGA GATAGCCAAG CTGGCCAAGA TTCAGGCCAG GCGCGTCCTG 
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15721 ACGGACGGCC AACAGCTCCG CGTGTTCTCC TGCCTGCTGG AGGCCGCGGC CAGGGAGAAC 
15781 TTTATCCTCC CGGTTCCAAC GCCCGAGGGA CAGGGGGGCT ATCAGGGCGC GACGGTGATC 
15841 AACCCCATTC CGGGGTTTTA CGACGAGCCG GTCCTGGTGG TCGATTTTGC CAGCCTGTAC 
15901 CCGAGCATCA TCCAGGCGCA CAACCTGTGC TACTCCACCA TGATACACGG ACGAGACCTG 
15 961 CACCTGCACC CCAACCTGAC GCCGGACGAC TACGAGACGT TCGTGCTGAG CGGCGGACCG 
16021 GTACATTTTG TAAAAAAACA CAAGCGGGAG TCTCTGCTGG GAAGACTGCT AACCGTGTGG 
16081 TTAGAAAAGC GAAGGGCGAT CCGGCGCACC CTGGCGGCGT GCGATGACCC GTCGCTAAAA 
16141 ACCATCTTAG ATAAACAACA GCTGGCCATC AAGGTGACAT GTAACGCGGT TTACGGGTTC 
16201 ACCGGGGTGG CCAGCGGCCT CCTCCCATGC ATTAACATAG CGGAAACCGT GACGCTCCGG 
16261 GGGCGCACGA TGCTGGAGAT GTCAAAGTCT TACGTGGAGG CCCTG.ACGAC GGAAGACCTG 
16321 CGAACGCGTC TCGGTCGCGA GGTGACCGCC CGTCACGGCG CGCGGTTTCG CGTCGTCTAC 
16381 GGTGACACCG ACTCCCTCTT TATCGCGTGC GACGGTTATT CCGCGGAAGC CGTTTCCGCT 
16441 TTCTGTGACG ATCTGGCCGC CAGGATCACT GCGGACCTGT TCCCCCCACC CATTAAGCTA 
16501 GAGGCGGAAA AGACGTTCAA GTGTCTGCTG CTGCTGACGA AAAAGCGCTA CATCGGGGTC 
16561 CTATTGAACG ACAAAATGGT CATGAAAGGG GTCGACCTCA TTCGCAAAAC GGCCTGCAAG 
16621 TTTGTCCAGG AGCGATGCCG CGCCATCCTG GACCTGGTGC TCCACGATCC GGAGGTCAAG 
16681 GCTGCGGCGC GGCTGTTGTG CAAGCGGCCG CCGCACGCGG TATACGAGGA GGGGCTGCCG 
16741 GCTGGCTTTA TAAAAATCGT AGAGGTCCTC AACGCGAGCT ATCTGGACCT CCGAAACAGC 
16801 GTCGTGCCCA TCGAGCAGTT AACGTTCTCC ACCGAGCTCA GCCGCCCCGT CTGCGATTAC 
16861 AAGACCACCA ACCTGCCCCA CCTGGCGGTG TACCAAAAGC TGGCGAGCAG GTGCGAGGAG 
16921 CTGCCCCAGG TGCACGATAG AATCCCCTAC GTGTTCGTTG ACGCGCCCGG GTCCCTAAAG 
16981 TCGGACCTGG CCGAACACCC GGATTACGTC AGACAGCACC AGATTCCCGT CGCGGTCGAC 
17 041 CTATATTTCG ACAAACTGGT GCACGGCGCG GCCAACATCC TCCAGTGTCT GTTCGGCAAC 
17101 AACGCGGACA CCACGGTGGC CATCCTCTAC AATTTTCTCA ACGTCCCGTA TAAGCTGTTC 
17161 TCGTGAACGC CAATTGGAGA ACGCCAACAT AAGACGCCGC GCCAGCGGAG TCCGCAGGGA 
17221 GAGCTCGAGC GGCGAGGAGC GACCAACGGA GACCGCCACC ATGCTGGTTA ACGAACTGTC 
172 81 GGTGGTCCTC GGCGACTGGG AGGTGACTTT TCACCGGGGT AGATTCAGCT TCGTCAACCT 
17341 CACCCGCCTG CAAACGTTCA AGGGCCACGG GGGCTACGCC AGGGTCCGAC TCCCCTTCTC 
174 01 GCTCGACCAG TTACTCCACC AACATTTCGC GTTCGGACTC GTGACGCGTC TCAAGGAACT 
174 61 GCCCCCCTTC TCCGACTGCG TGGCCCTTAT CGCCCCGTTG GATTCCGGCG GCGACGCGGA 
17521 CGCGGCGCGC GTGGCCCCCG GGTTCGTGCT GGACTCCTCT CGCCCGCTGA CCGTGTGGGT 
17581 AAACGCGAGC GGGCGGCACA CGATCCGGTT CTGCCTCCTC TTTCTAAAGC CGATCGACCT 
17 641 GGAGCGCGCG GTCACGTACG TCTTCGGCGA GAACGGCGGC GCGCGCTCGG AGGGCACCCC 
17701 AAAGCCCACC TGCGCGACCG AAAGCCTGCC CGGTGGACCC CTGCGCGTCT CCGGCGAGGC 
17761 GTCTCAGACG TCGCCCCATT CTTTCGTTGC GTATTTTCCC ACGGCCAACT CGGTGGCCTG 
17 821 CCTAAGCCTG TTGCGGTTAC AGGTGAGGCC GTTTTCGGAT GACGCGGCGC ACAGGGACGC 
17 881 GCGGATCTCC CCGAAATACG TCACGTTTAG TAACTCCGGG GGTAACGTCT GCAAGGCGTC 

17 941 CGTTCACACG CTGTCCCCGT CGCGGTGTAA AACGGCGCAA ATGGAAATCA TCTACGCTCC 

18 001 CGGGGACCCC AACGCCGAGA TAGTCCTGGG CCAGTCCGGA CCCGTCCTGC CCACCCACAC 
18 061 CGGCGGCCGC GTATTGGGGG TCTACGCCGA CGCCGAAAAA ACCATCCAAC CTGGAAGCTC 
18121 CGCGGAAGTC CGGGTTCAGT TAATCTTCCA ACAGGGAGCG GCCGCTCGGG GCGATCTGGC 
18181 GTTTCTGGTC ACGGGCGTGG CACCGGAGCC CCTATTCGTC GTCACCCCGG CACTCTTGCT 
18241 TTCCGGTTGC ACAACCCACC TGCGCCTATT CAACCCCAAC GGTACCCCCA CGACTATAAA 
18301 AAGAGACACC CTTGTGGCCG CCGCCGCGCC CTGCCCCGTG GTGCGATTAA GCTCCGCCGA 
18361 CGACGCGCCG CGAGACCTCG TCGCGTCACC AGACACCGGG GCGCTCTCCA TTAACGCGTT 
18421 CACAATCCCG GTCGGTTTCC CAGGGGTGGT CTCGGCGGAG TGTCACGTGT CCCTACGCGA 
184 81 CAACGGGGTC CACGAACGCA TGAACCATTG ACGGCAACGA TGGGAACACC GGTGCGTTTC 
18541 TTTCGCGGCG AGTGGCAGAC CTCGAGTCTA GTGGACAACG GCACGCCACG GTACAGCTCC 
18601 CTGGTGTGGG CCGCCACTAT TCACGACGGC TACCTGACAC TGGTGAACAG GTCAGAGCTG 
18661 TGCGTCACGG AGAGGTCTCC GTGTCTGCCG GCATGCCCCA GCATCGGGAG ACTGGTCGGG 
18721 AAGAGGTTTC CCGGCTTCGC CTTTGCCAGC GCCACTCTGG GCGATCGGGG AACACGCACC 
18781 GTGTTCTACG CGTTCGGTCA CCGCGACAAC CCACTGGACA TAGTACCCGC CGTGGTCGAG 
18841 CGCGCGGATC GCGAGCTGGT GCTGCGGGTT CACGCTCCGC AAACAACGCG GGTGTCGCGA 
18901 TACGGACTTA AGGTATTCGT GGCGATCGTT ACGGTGGTGC GCCCGCCCGG GGTGTTCCTA 
18961 CACTTTCCAC AAGACCGCGT TCCGATCGCG CTGACAGACG CGTGCAGCCA GGAGGGCTCC 
19021 AGGCTAACCT CTGAAGAGCC GTGGATAAAA ATTCAAGGCT TTCCCGTCCT ATCTGACGAG 
19081 ACCGCGCACC CATTTCTCTT AACCCAGAAG ACCAAGCCCT TTACCGAGCG AAAGTTTTGC 
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19141 CGCCTGATCA TGGACAACGA CCAGCGCAGC GCCGTCAACA CCGTCTACCT GGGAAAGCAG 
19201 CACGTGAGGG TGACCGTGAC CCGCCCCCCG GAAACAATCG TCACCGACGG CCCCGTGACG 
19261 GCGACCCTGT CCCTCACCGG TAATGCGCCA ATCGCCTTTC GCCACAACCC ATACTTTGAA 
19321 CTCCCGTGGT CGTCCACAAC GGCGATATTC ACGCCCGTGG TGTACGTGGG CCTGACCGTG 
19381 TGCATCCCAC CCAACTGTAG CAAATTCGTA AGGTACGGTA ACACCTACGT CTCGGCATTT 
19441 AACCGCAAGC TGACGGCGAT TATTAGCAAT CACGCCCACA ACGGCGGGTT CCGGATTCAG 
19501 GACTGCGAGT GGCCACCGAA CCGGGAGATA GAGATTTTGG TAACCAACGT GTCCCAGGCC 
19561 CCGGTGTACA TCAGCACCGG GACGCAGCTG GGGCAAGCCA TCTTCGTGTT CGCGCCGCGG 
19621 TTCGGTGGCC CGGCGAAACT GCGGCAGCTC CTCGGCCACC GATCGCGCGC CCTGGAGCTG 
19681 CCGGGCGGGG TGACAGTGGA CAGCCAAAAA CTGTGTAGGT TTGAGACCAT GTACCTGTTT 
19741 TCCACGTAAA TTACTAATAA ACCGTTTGCT CGTATCGCTC ACACAACGCC AAACCGTCTC 
198 01 TCATTCTCGG GGTCGCGCGC CTCGCGAACA CACAAGGTGG CTCAAACACC CCCCTCCCGC 
19861 ACCCTCGCCA CACAAAACCA GTTAACGCCT TCCGTTAGAT GCAGTTTATT TATTATTTTA 
19921 TTACATCATA GCTATTGCGC GGCGCCCGTC CCGCAAAAAC ATCTGTAGAT ATTCCAGTAT 
19981 GCGAAACGCG CTGAGAACAA CGTCCCGGGG GCTCTGCCCT CCCAACGCAC GCACGGTTTT 
2 0 041 TTCATTAGAC TCCGCGCCAC CTATCTTGTT ATTTACGGGA AGCTCCTCAA TTAGAGAGTC 
2 0101 GAGGGCGGAC AGCACCACGG TCACCGCGGA GCCAATGGCC GCGGCGTCCG CGTCCCGGGG 
2 0161 CGCGTCGTCC AGCAGCATCC TCAGTCGCGT CAGGTAACTC TCGTAGGCCC GGAGCCCGCG 
2 0221 AACCATGGCG TTCATACACT CGGCCCCGCG AAACTCGCGG CGCTGGCACT CAACGTTAGA 
2 0281 CATCAATATA GGGAAGTGAA CAAACGACAT GATGCGCGGG TACTCACGCA CCCGACAGAG 
2 0341 AGTGGAGTGG TGGCACAGGT AAACGAACCA GCGCTGTATG TTTAACCCCA CTTCAGACAG 
2 0401 GCGCCCCCTG GTCCTGTCAA GAGAGGCGCG GTTGCCGGCC CACTGGGGGA GAACGTTAAT 
2 0461 TCCAGCGGCA GTCGGGGGAG GCGCCAGCGT AGGGCTGGCC GCCCAACACG ACAGGTAAAA 
2 0521 CAAGACGAAC CAGACAGGGA ACATGACCGG TTAAATTACT CGCTTACAAT CGCGGGCGGC 
2 0581 GGCCGGTCAA CGCCAGGTCC ATTAAAAACA CACCGGCCGC CAACCCCCAA CGCGGGGCCG 
2 0641 CGCCCTGGAA CGCGGTTCCT TCCAATCGCA AAGAACCGCG TCACAAAAAG GGCTCGTTTT 
2 0701 GAACCCATTT TGTGCCATCG GGTTTCGTTT TCAGATACGG AAACGGCTCG TCCAAAAACA 
2 0761 CCCAACGGGG GTGTGCTCAA ACGGCCATCT CCATCTCTAT GTGGGG ATGG GGGTCGTAGC 
2 0821 CCTCGAGACT CAGATCCGCG CGCGTAAAGT CCTCCAGACG CGCCACCTTT CTCAAAATCT 
2 0 881 TCAGCCGCGG AAACGGACGC GGGGTCCTCC GCAGCTGAAG CAGCAGGGGA TCAACGTGGT 
2 0 941 TGTTGTAAAC GTGGGCGTCA CCCAAGGTGT GCACAAAGTC TCCCGGGGTC AGGCCCGTGA 
21001 CGTGAGCGAT CAGATACGTC AGGAGGGCGT AGCTGGCGAT GTTAAACGGG ACCCCGAGGC 
21061 CCATGTCGGC GGACCTCTGG TACAGCTGGC AGGACAGCTC CCCCCGAGCC ACGTAAAACT 
21121 GACACAAAAC GTGACAAGGA GGGAGAGCCA TCCGCGCGAG GTCCGCGGGG TTCCACGCGC 
21181 ACATAACGAT GCGCCGATCG TGGGGCCGCC TGTTAATTAG ATCCACCACG TAACGCAGCT 
21241 GGTCCACCCC CTGACCCTCG TAGTTGGCGT CGGCCCCCCT GTACTCCGCC CCAAAATGTC 
213 01 TCCACTGGAA CCCGTACACC GGCCCCAGAT CGCCCTCGCG GCGGTCCCCG AAGCCCTGCG 

213 61 CCGCCAAAAA GGCGCGGGAC CCGTGCGCGT CCCAAATTTT TACGCCGCGG CGCGACAGTT 

214 21 CGGTGGAGTC GGTGGAGCCC CTGATAAACC ACAGCAACTC CTCCACGACG CCCCTCCAAA 
214 81 ACACCCTTTT GGTGGTTAAC AGAGGAAACT CGTCCCTGAG GTTATATCGG GCCTGAAGCC 
21541 CGAACACGGA CCTGGTGCCC ACGCCCGTCC TGTCCTCCCT CTGCACGCCG TGTTTAATAA 
21601 TTAAATCCAG GTGCGCCAAG TACTGCAGCT CGCCGTGGTC GCCGCGGCAC GCGCAGCGGA 
21661 CGGCCGCGGC ACCTTCGGGT GCGGAAAAAA CCCGGAGTCG TGCGGCCGCG ATTCCAGCCA 
21721 CGCAAACGGG TATAATTTTT TTAACGTAGC AGATACCGAG ATGCACCAGG ACAATCATAA 
217 81 TTACAACCGA GGTGCTTTAC AACTATAACC GTAAACGGCT GAAGACGAAA CTTATTTTAA 
21841 AGGCAGCTTG GGCGGGGCCG ACCACAGGCG CGGCCCGGCT GTTAGCGAGT AATAAACCCA 
21901 CGTGGTCTCC CCAAGCGTCT CCCCGAGCGT GAGCGTCTCC CCAAACGCCA AAAACAACAC 
21961 AGATAATAAA ATAAATAACA TGTTTATTTT TTATAAACTT AACGCGCGAT TATGCTTCTT 
22021 AACAGGGGCC AATAGAAGCA ACGTGCGAGG CGGCATTCGC GCGCTCAAAG GCAACCGCAA 
22 081 CACCGGCGCT CGTAACAGCG TAAACAAACA ACTATTAGCC ATTCGTAACC GTAAAATCCA 
22141 ACCCTCTGCG TCCCGAGCCA TTCGCCCAGC TGCGTCTACG TCGAGGGTGT TTCTAAGTCG 
222 01 CATTTTGAGC CCTTGGCACG GCCGCGTCTG CGTTAATGGT GTTTCTAAAT CCCGCTGCCA 

222 61 AGGCCCTCTT CGGAATCGTC AATCAGGCTG CGCCGGGTTC TTCTGCCCTT TTTGGGACGC 
22321 TTGTCCACCG CCTGAAGCAG TTTCTTCACT GCGTCGTCCC CGGGATTGGC ACACAGCTTT 

223 81 TTACCGCGGC GAGTGTTTAA TATCACGGCG TCCACCGAGC ACTGCGACGA GGTGTGGGAG 
2 2441 TAAGAGACCA CTAAACCGGG TGGCGGCAGA TGAGTTACAT ACCCCAAACA GCAGAGTTCG 
-52 501 GGCGCGGGTC CGCTCATGGA GCCCATAGGA AAGGCATAAT CAACTACACA CGCGAACACC 
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22561 GCAAAAAAAA CGCACACGAA AAGGCCCCTC ATGGTGACAA AAACACAAAA CAAAGACGGG 
22621 GTCGCTCGTG CGTTGCCACC AGCGCCACGC TCGCCGTGCT CGGCCACGGT TGACGGCGTC 

22 681 ^ATTAATACC CGCGGCCCGG GCCCGCCTCG ACGGAAACCA GGCGTGGTTA GCCACCTGAC 
22741 GCACGTGCCT CCCGTCCAGA TGTGGCATTG CGAGTCCCAA AACGGGCGTG TCTAACTACG 
22801 GTCGCCCCGG GGTTCAAAGA ACGGGGATCC CCCAAACAGG TAAAAAGCTT TTTGTCCAAA 
22861 CAGAAAGCCA ACAGCAACCG AAAGGAATCT CCAAAACAAG CAAGGGTCCC GTGCCGGCGA 
22921 TAAACGGAAA ATTTCCCTCA CCTAAACACA CGCGCCGCGT AAACTGTTAA AACACGCAAT 
22981 ACTTCCTAGG CCTGTGTTTA ATAAAACACA CAAGGGTTGT AATTTCGCGG GGCCCTTTGG 
23041 GCCCCGCGCG TCTGGGGGTG CGTTGTCTCG TTGTGGTTCG CAATCACCAC CCCGCCTGCC 
23101 CGAAAACAGG GCAACAACCC CCTGGCCTAG TTTTTTAAAA ACTTAACACC GGCAAGGGGA 
23161 GAGGGAGAAG GGGTGCGGCT AAATGGGCTG TTAGCAGCCA ACCGCAGCGG GGAAAAAAGG 
2 32 21 GGGCAAACGC CGGGTTAACA CAAACAGCAA CAGGCGCCAA ACCCAATATA ACACAACGCA 
23281 TATTCCGTGT GTTTCGCGGG CCGGGGTCGG TACGGCTAAC CCGCGTTGGG CGTTTGCACA 
23341 AGCACGCCCC CGCCCTGCAC CCGGCTCCCC GAGGTCACCG AAGGGTCAGG AAACATAGTT 
23401 TTCACGGTCG CCAGTCGCGC GCGTGGCAAC ATCTTTCGGT GCCCTCCGAA CGACCCGGAA 
23461 AAATCCCCGC CGCGCCGGGT CGTTCGGAGG GCACCGAAAG ATATCCCCAA ACGCAACCTA 
2 3521 AAGCATCATG TTTGGGGTTT CGGTGACGCG CGCCGCAGAG GAGGCCGGTG GTGGCGCTGG 
23581 CGAAGATAGT GGCCGCAGAG CACGGGTTTC GGTAGTGACG CGCCTGGACG GCTGCCACCT 
23641 GGCAGCAGGC CAAAAACACG GGCGCCGCGA GCGGCCGGCA GGTGGCGCCG CGCTTTCCCC 

23 7 01 CACGCCGCGA TTTGGCAAAA TGGCGGACCG GCAGCGCCGG TTGGACGCGG CGGACGTCAA 
237 61 TCACGACGGA GAGAAGCGCA GTTGGTCAGT TGTCCCGTCA ATCACCGAGG GCCGCGCCGG 
23821 TTGGCGGATA ATATGAAACC GGCGCGGCCA TTGGACGCTG GCGGCAGCCA ATGGGCGCGT 
23881 TGGGTTTTGT TTACAAGTTC CCTATATATA TTATATTATA CCTTCCCTGT TAAGGGAATC 
23941 CCATGTTCTT AAATCTAATC AGCTGGGTTA TATAATTAGA ATATCAGTAA TTACTATTAT 

24 001 TACTGTATTA TTTATTAAAT TATTTATTTT ATTATTTATT TAATAAATTA ATTCTTGCGT 
24061 TATTATGTAA TCAGATACTG GGAAATCTAC TCCCAGTTTA ATAAAAATAA ATAATTAATT 
24121 CTATTATGGT GGGGGTTCTA AATATTGTAC AAAACAACAA TATTTTTTAA ATTTATTCAC 
24181 TAAAAATGCT CCCATTTTAC TTGTGACGTA CATGTTAGGA ATGTGGGAGT GGTTTGTAAC 
24241 TATGTTTTCA AACAACACCC TTTTGGGCCG TCTGAAACTC ATATAAGCGG AAGGCTCGGC 
24301 GTTAATTGCC ACTGGCCGCT AACCAGTCCA CCTTGCCAGT TGGAGTTTTA TTGCTGCTTA 
24361 TGGCTTTACC TCGCATTCCG GGCACAAGCT GATTTACGTA GCACGCATTG GGGGTTTAAC 
24421 CCGGTTGAAT TTAGTGGTTT GCACGTGTAA AAAAGCCCAC CGAGGCTTGG GTTTAAGCGT 
24481 TTAAAGCGTT GCTGGTTTTA TAACATGCGA TTTAATTGCT ATGGGAGCTA GCTTGTATTC 
24541 CGCATGCTTT CGGTTTAGTA GTTGTACAAA ACAGAATTGT AGCCCGCAAA TTGTTACTGG 
24601 TACCCAAAAC GGCAAGCCCG CTTAACACGC CCCGCGCTGC GGCTTTTGTT CCTAGCCCGC 
24661 TCGCAATACA ATCGAGATTT TAATTTAAAA TGGGGCAATG GCGTAGCGCA AATTTAAGCG 
24721 CGTCAAAAAT TTAAAATGGA CTAACGTAAC CGGTGCTTTT AGTGACCTAT GGCGAATTTT 
2 4781 AAAATTTAAA GCCGTGGGTT TTAACACAGA GCTGCCAGCT TGTAATACGT TGCATATACA 
24841 GTGCCGTTGT GATTTTATGG TTAAGTGGAT TTTAACATTG AATTTTGTAA TGGTGTACAC 
24 901 GTGGATTTTT AAATTGAATT TAAAATTGTA ATTACGTGCG CGTGGATTTT AAAATTGTAT 
24961 TTATGTAACA TTGTTAAATT TTAACTGCGT TATGGTTGTA TTTTGGTTAA TTTATGTAAT 
25021 TTTGCTTTTA GATTTGTTCG GGTGTGAATT CCAGACAGGT AAGAGCCTAG CTCCTAATGT 
25081 TTGCCTTGCC GCCTAGCTCC TAATGTTTGC CTTGCCGCCT AGCTCCTAAT GTTTGCCTTG 
25141 CCGCCTAGCT CCTAATGTTT GCCTTGCCGC CTAGCTCCTA ATGTTTGCCT TGCCGCCTAG 

252 01 CTCCTAATGT TTGCCTTGCC GCCTAGCTCC TAATGTTTGC CTTGCCGCCT AGCTCCTAAT 
25261 GTTTGCCTTG CCGCCTAGCT CCTAATGTTT GCCTTGCCGC CTAGCTCCTA ATGTTTGCCT 
25321 TGCCGCCTAG CTCCTAATGT TTGCCTTGCC GCCTAGCTCC TAATGTTTAA CAACATTAAT 

253 81 GTTTAAGCAC ACTAAAATTT AAAGACGTTT GTGTTGGTTT TTATGACCAG CTTGGTACAA 
25441 AACCTGCTGG TGATTTTTTA CCCAACAAAT AATAAATAAA AAAGTTAAAA CTTATTTCTG 
25501 TTGTCTGTGG TAATTGGGTT CCCCGAGGGT CCCGGGCTCC CCGTTCCCCG AGGGTCCCGG 
zllei gScCCCGTT CCCCGAGGGT CCCGGGCTCC CCGTTCCCCG AGGGTCCCGG GCTCCCCGTT 
25621 CCCCGAGGGT CCCGGGCTCC CCGTTCCCCG AGGGTCCCGG GCTCCCCGTT CCCCGAGGGT 
"eS^ CCCGGGCTCC CCGTTCCCCG AGGGTCCCGG GCTCCCCGTT CCCCGAGGGT CCCGGGCTCC 
Isill CCGTTCCCCG AGGGTCCCGG GCTCCCCGTT CCCCGAGGGT CCCGGGCTCC CCGTTCCCCG 
25801 AGGGTCCCGG GCTCCCCGTT CCCCGAGGGT CCCGGGCTCC CCGTTCCCCG AGGGTCCCGG 
Will Sct?CCCGTT CCCCGAGGGT CCCGGGCTCC CCGTTCCCCG AGGGTCCCGG GCTCCCCGTT 
2^2^ cScGAGGGT CCCGGGCTCC CCGTTCCCCG AGGGTCCCGG GCTCCCCGTT CCCCGAGGGT 
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25 981 CCCGGGCTCC CCGTTCCCCG AGGGTCCCGG 
2 6 041 CCGTTCCCCG AGGGTCCCGG GCTCCCCGTT 
2 6101 AGGGTCCCGG GCTCCCCGTT CCCCGAGGGT 
2 6161 GCTCCCCGTT CCCCGAGGGT CCCGGGCTCC 
2 6221 CCCCGAGGGT CCCGGGCTCC CCGTTCCCCG 
2 6281 CCCGGGCTCC CCGTTCCCCG AGGGTCCCGG 
2 6341 CCGTTCCCCG AGGGTCCCGG GCTCCCCGTT 
2 6401 AGGGTCCCGG GCTCCCCGTT CCCCGAGGGT 
2 6461 GCTCCCCGTT CCCCGAGGGT CCCGGGCTCC 
2 6521 CCCCGATTCA ACGTTAAAGT GGTTTATTTA 
2 6581 TTGTACATAC ACGCGGGTGC GCATTGGCCC 
2 6641 GAAACGCGTA GGTGCTTTAG TGTTTCATGG 
26701 GTTTTTTGGT AGCGAATGCA ACCAGGTAGC 
26761 CCGCAGTGTT ACCGCGGTGC GGCATACGGT 

26 821 TACCTTAAGT TTACGTGTAG CAATTATGGC 
2 6 881 GGAAGAAAAT GAAAATTCTT TGCCAGTTGA 
2 6 941 CTGTGGGCTC GGGCAGGCGG AGTACTTGCA 
27001 TATCAGCGCC TTTCTCAAGG ACAGCGCTCG 
2 7061 TTACCGCTCT CTGCGGGGAC TAAACGCCAT 
2 7121 CGGACGCCAC AACTGGGGTC GCGCTTTGGC 
27181 TAAGGTTAAA GACGACGAGC GTCTTTTAAC 
27241 GTACGAGGCG CTGGAGTCTC AGTGGTTTCG 
27301 TTACTGCGAG CGAATACTGA GGCATCGCCG 
27361 TGCGGCTGGT CTTCTGGCGC TAGTGGCGCT 
27421 ACCGCGTATT CCCCCACCCT GACCTTAAGT 
27481 AAACAAGCTA ACGTGGTATT CGTTGGAACA 
2 7541 TTTTTTGTAT CTGTGACACG GGGGCCGGCT 
27601 ACTGTCCGCA TTCTGGATCG AGGGGCGGGC 

27 661 GAATGGCGTT TGACATTACA GATTCTTGGG 
27721 CGAGACCGGC GTGGTGGCTC GCCGGCTGCG 
27781 GCGCGCTCTG GTACGGCTGT GGGGCGTGGA 
27841 GCTGGTACGC TAGGTACGGC GCAGACGCGT 
27 901 GTGGTTGCGG TCGTTGGTCG GCGTCCTGTG 
27961 GGTCCCTCAG ATCCGCCTGA ATGTCCAATA 
28021 GTATGCCGAC CTCCTCGCCT GGAAACACGG 
28081 GCTTCTGGTG GCGAGACGGT GCCGGAGCGT 
2 8141 CATACGGAGA GGGTTGGGGG GCAAACGCCG 
2 8201 GTGCGTACGA TGGGACGGTC AGCTCAGGCG 
2 8261 CCGGCATCGA GGGTGGCGCA GAAACGTACG 
2 8321 GACTGCTCTG CACCATGGTC AGAAAGGTGC 
2 8381 GGGCGCTCAT GGTTCGGGTG TCGCGGTTAC 
2 8441 CCTTTAAATA CGTACTTCTC GCGACTCTGG 
2 8501 TGCGATCTCG GATAAATCCG GCGTCGATCG 
2 8561 CGAATTCTGG CGGTGTCACC TGACGTTCGC 
2 8621 CGGCTTCCTC TCGCGTAAGA GAATCAAACT 
28681 AAACGGCGAT TGACCCCCTA CGCCTCCCCA 
2 8741 GTGGGTGATT GGGGTCTTGC AGCTCTTCTG 
2 8 801 GCCATGTGTG TAACATCTCC AACAGGGGGT 
2 8861 CCGCTCGCGA GTCCCCCGCG AGCCGTGACG 
2 8921 TGCCCAGGCA AAATATCCCG TGTGTTACTT 
2 8981 CTTCGGGGAC GTGCTCTATG TTTATCGGTA 
29041 CGATTGAGGG CTCTAAATAC AGCTCCTTTT 
29101 ATCCCCCAAC GTACACGGGA GTCATGTTGG 
2 9161 GCGCCGGTCT TCGGGGGCTG ATGTGGCGCG 
2 9221 TCCCTGGCGA GCTCAGGTTT CTTCACCTGG 
2 9281 ACGTGTACCT TCTGAAGGAG GCAATTGCCA 
2 9341 GGCGTAAGGT TCCGGTGGAG TTCTGG7VAAA 



GCTCCCCGTT CCCCGAGGGT CCCGGGCTCC 
CCCCGAGGGT CCCGGGCTCC CCGTTCCCCG 
CCCGGGCTCC CCGTTCCCCG AGGGTCCCGG 
CCGTTCCCCG AGGGTCCCGG GCTCCCCGTT 
AGGGTCCCGG GCTCCCCGTT CCCCGAGGGT 
GCTCCCCGTT CCCCGAGGGT CCCGGGCTCC 
CCCCGAGGGT CCCGGGCTCC CCGTTCCCCG 
CCCGGGCTCC CCGTTCCCCG AGGGTCCCGG 
CCGTTCCCCG AGGGTCCCGG GCTCCCCGTT 
GCAAACACAC GTTTTTTGCC GCGGTTTGGG 
GCCCATGCAA AAAAATAACG TTGGGTTTCC 
CACGTTTTCG TGTCACGGTA GGAGTGGTTA 
TCGCCTGACC GGGAAAGGGA ACGTTTTAAA 
TTAACGTGCA ACTGCCGGTC CCGCGCGTGG 
GGCTGTTCAG GGCCCTCCGC CGCCCCCTGA 
CGTTTATGCT ATAGAGGGCA TCTTTCTTTA 
CCATCCCGTG TTTAGCCCTA TTAAGGAATT 
CCTGTACGAG AGGCTCTTGC GCCACACCGA 
AGGCCAAGGG ATGCTGCAAA TAAACACGGA 
CGTGTTGGGT CTTGGTGCGT ATGTGGTGGA 
GTTCGCCATA GCCGTTCTAC CCGTGTACGC 
TTCACACGGC GAATGGGAGG GACTCAGGAA 
CAACGCGAGG AGACACATGT GCTACGGAGT 
GTTTGCCATC AGGCGATAGG TGTGCGTTTA 
GAGCCGATTG TATATGAGAC CAATAAAACA 
TTTTTTATTT GAGCAGTTCC TCGCAGAACA 
GTGCGCTGGC CTCCACCGGG GCCCGCGCGG 
ACGCGCCAGC GCGGGGGGCA GAGGCTGAAG 
CGGCGGGGGC TGGAGTTGCC GGCTGCGTGG 
GGGGCGGTGG TTGCGTTTGC TGGATGCCCT 
GATGGGTGTC CGTTCCGGAA TACCACTGCG 
AGGTCTGTGG CCACGAGAAC TGAACGGGCG 
CGCCGGAGGT CTGCGACGCG GCTCGTTTTA 
TGTTTTTGGA CAGCGCCATG ACGTCCTTGT 
GCTCGTCGGA AAGCTCCACG TCCCGTTTGC 
AGCCGTACGT CGGAGAATAG TACGTGGCCG 
ACTGGAAAGG GAACGGTGGC GCGACGGGCG 
GCGGTATCCA CGATGGATAC GCCGCCGTTG 
TGCGGTGGCC CTGGTTGCGC ATGTTATCGA 
TCTTGGGAAT GGTGATGTTG TCCTCCGGGA 
CGCAGTGCTG AGCGCAGGGA AACTGGCTTG 
CCACGCCTTT ATCTGTTTTG AGCAGGTCTG 
CCTTGGCCAT CAGTGTCTCA AGAGGGGCCG 
AAAGATCTAA ACAATTGACC GTTATTTTAC 
TGGAGACGAC CCATGTAGGG TCTGGTCCGT 
GTGCGCACAG AGAGACGTGC TGAAACGCGG 
GGTGGAGGGA CGAGAGGGAG AGCTCCGGGA 
CCCGAGGCAT GGGGTTTAGC TGGATCTGGG 
CCAGGGCCAA GAAATCGTGG CTCGTTAGCT 
GGAAGAGACC GATTGTGTGA CCTACGTGGG 
GAGGGTCCGT ATACGGGAGC AGGGTCGCCA 
CTATCTTTGG TAGGCTGACC ACGTCCACGT 
GTCGGTGTCT GCGTGAGCCG AAGGAGATGA 
TTATCCGTTG TGAGAATTTA AATACGTTTC 
TGCTGTGCGA AATGTACAAC TACGGTCTTA 
ATACCGGAAC GAGGGACGAT ATTGTCCTCG 
TCGTTTACGA TGGGTTAAAG GAGATGGGGG 
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2 9401 TTTCTGACGC AACGTTGCTC TCAGAAACCA AAAGGGGGGC TCTGTGGCTT TACTTCAACG 
2 9461 GGCGTCCGTG CCTGTTAAAA GGTCTAGGCG ACTACGTTTT TTGCCAACTG GGGTTAAGCC 
2 9521 ACAGCGTACG CGTGGTTCCG GAAAACCTAA CGGATGGGAA TTATCTCTAT AACCTAGGCA 
29581 GCGTCATACC ATGTCGCCTG CTTGTGGCAC TGAGCTACTG TTTGGCGTTT TGGGGGCACG 
29641 CTGACCACGA GCCCTGGGTC AGGTTGTTCG CCGGCAAGAT TTTTATACTG TACTTAATAA 
2 9701 TTTCCGGCCA CATTATGCCA CGGAAGTCAA TCTTAGAACA GGTGGGAACG TCCGGGTACG 
2 9761 GTGGCTTCGT CGAGGCAGTG TGTCGCGATG TTCGCGCGGT ACACGGGATT CCGGCGTGGG 
2 9821 ATTTTGCGTC CGCGGCTCCG GCGTTAACGT CTCAACAGAC GGACTACTTG TTTGCGTTCA 
2 9881 ATAACAGCGT AGTGTAGGGT CGCCTCACAC TACCGACGTG GCGACCGGCA GGTATCCCAG 

2 9941 GACCAAAAAA TAAACTCTGT CGTAGTCGTC GTAAACTCCG AATTGGCTTT TCATTATCTC 

3 0001 GTGTGCCAGA CCGGGGGGCG ATGGGTGCGA AAGGATGGCG GCCAGGCCTT CCTCGGCCGT 
30061 TAAACGTAGC GACGTTTGAG CCTGAATCAG GCTGCCAGCG TCCCTGAACG TGAGCTTTTG 
30121 GTTCAGGATG TCAAAGATTT CGTTAAATCT GGAACCGGTC AGGTTGACGA ACGGACGAGT 
30181 GTGTTTTAGA TGGGATATGG CCTTTGCCTC CAGCGCCGCT AGATACGCAC CCGGGAACAA 
30241 AAACGACGTG GGGGCGTCTG GCGCGCGCGA AAGAATTGGG GAGATGTAAT TTTCTGTCAG 
30301 GAAATCAAAA AGTTGTCCCT TTTTAAGGTA TTGTTTGGAC CCGGACGGGT CTTCTACCTT 
30361 GGAAAATCTC CCACCCAGGA GCGAGGCCAG GTTGAGGCGC CGGTGGGGCG ATCTGGGTGT 
30421 GATTTGGGCC AGCTCCAATA GCACGTAAAC CAAAACTAAA CAGCCTAGTG CCGAGAGGCC 
30481 GGCTCGGAGG TTCCATTGGT ACGCTTGAAA GGGAGGCACG TCTTCGCTGA ACAAAGTGGT 
30541 CGCAAGGATG TATACGAGAT CTGGTCCAGG TGGCTGACCG' GTACCGGGGT TCTTGGAACC 
3 0601 GGGTAGCAGT CCGCTCTGTG CCAACGTCGC GTATATCACG TGGTTTGTAA AGAAGTCCGC 
3 0661 GTTGTACTTT GTTTCTTTGT TTAGGGGCGC TAGAGAGGCG CGTTGCCTCG GATTCGAATA 
30721 TGTAAACTCG AAACCCAGGC CCTTCGTGTT TTGTTTTAGG TGAGAGACAA GCGTCGCCAG 
30781 CTTTGTGCCG AGGTTATCCC AGAGGCCGTT TCGAGTTGGC GGCAACGGGG TACCTGTCCG 
30841 GTGACAGTAG TACCCGTTGA GGAGGCACAG CGCGGCCTGC GTGTCCGATA CGTTGCGGAC 
30901 ATCCGCGTAA AAGTCTTGTG TGATCGAGGC GGCTGTTGAG ATTACGGCAG ACATTAGCTT 
3 0961 TAGAGACGTG GAGTTTTGAA AGTTTGTTGT GCCTCGCAAA TCTCTGGGGA ACACCCTGCG 
31021 TTGCATGGCG TTATCTGTGA GGCTACTGTA CCACGGCCCG TATTGTGGCA GCCATTGGTT 
310 81 TTGGTTCATG TACAGGCTTG GCAGGAGCTC CAGACGGAGA TCCTGGTTTA CCGTGAATCC 
31141 CGGGTCTCCG GGGGCTATTG TGACAACGAA CGCGTCTCCA TGTGTCTGCG GGGCGATTTT 

312 01 GGGTTGCGGG TTTAGTGATG TTTGGGCATT GGCGGTTTCC CTGCCCGGGG ACCGCAGATC 
31261 CCCGACCATG TTTTGAATTG TTTCTAGGTC GCGCATGACC ACTTCTGTTT CCCGTTGGTG 
31321 CGTTTGCAGC ACGTTGTCTA GTTCTAGCCG GAGCAGGTTT TTTCGCAGCG TGTGAACCCT 

313 81 TAGCGCCAGC TGTTGATTTC GCATCACAAA CGCGTTTCTT CTTTCTTCTA TCAGAGATGG 
31441 TTTGTGAATA AACAGAACGT TGTTTGGATG GGGGGCCCAA AAAGTTGCGG TAATTTGCCG 
31501 CGCTGGTTTT CGTGGGTATC TCATACAGCA TTTTTCTGAC GTGCGCATCT TCCCATTGGC 
31561 TAAAAAACCC GTTAAGCGCC GCCGACGTGG TGTTGATAAA TGTAGGCGGA AACGCTGATG 
31621 ACTCAGTATG AATTGTTTTA AGTGTTTTCT GGTTCTTAAA TAGCAGCCAG GGCCTTACCG 
31681 ACCAGTTTTC GTGTCCTCTT GGCGCGGCTC TGCCTATATA TGCCACGGAA TCTGTCAGTT 
31741 GGTGTAAGCC CTGCCATCGC TGTGATATTT TTATTGCGTT GCCGGGAAAT ACAGCCGCGC 
31801 TTAGGCACGT CTTGAGTTCA ATTACGTAGC ACGTTTTTTG ATGTTCGCTA TTAACGGTAA 
31861 GCACACAATC TGCTATTCGC CTACCTAGGG AAACCTCGAA AAAAATCCTA AATTTAACGT 
31921 TTTTGGTTGC CCTGAGTGTT GGGCGAATTG AAAGAAACTT TAAAATATCA TCAAGCGACC 
31981 GGTGTTTGAT TAACCGTCGG TATACTGCTA AGTGGGCGCG AGTTCCAGCC CGTTTGCGTG 
32041 AGGCCGGAAG GGCTTCGAGG GTCGCGACGT GCTTACATTG ATTTGCAAAC GCCATGGCCG 
32101 AAGGAGGGTC TGGCTTTGGC GACGAGCTTG TCCGTCAGAT GCGGGATCGG AAGCCTAGAT 
32161 GGGACGAGTC TTCCGACGAT ACTGATGACG TTGACACCGA AAGCACCGAT CTTGAGTATG 
32221 ATGATGTGTT CCCCGTGGTA GATACGCACG GCTTAATGAG CCCTGGAAGT CAAAACTATG 
322 81 ACGTACCCAC GTCTCCGTCC GGAACGCCGT GGGAATTACT GCACCCGGAC GCCCTGTATG 
32341 CACATCCGAG GTGCCCGCCT AAAAGGGCGG TGGTGCCGGG CGGTGGGGCG CGTCCCAAGG 
324 01 TGTCCGCGTT CTCGGCTAGA CTTCAATATG TTGGACGACA GAGTTTTGGG GATAGAGAGA 
32461 CGCGGCAGCT TACCGGGGCT CAGTTTTCTT CCGAGAGCGA GCACGAATAT GCGGAAATAC 
32 521 CGGAGCGTAC TACTACACGC CCGGTTGAAA GCGGTGACAA GAGAAATTTC ACCTCCGGGC 
32581 GTAGGGGTGC AATCTCGGGA CCCTCGTCGA CAAAACCTAG TCACGGTGCG GGGTTAACAC 
32 641 GTAAGACTAA AACGTCGCTG AGTGTTAGTC TTAAAAACTT GCTGCGGATA AAGGACGATG 
3 2701 ATGCTAAGGT TGATGTACCC AGGCCGGTTA CCGTGCCGGT TCATCTCATG CAGCCGCATC 
32761 CTATGACGGA ATATAGAAAT GCCTTTTTAA TCTATCTAGA GGGGGTCATG GGTGTTGGAA 
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32 821 AAACAACGCT ATTGAATTCA ATGACTGGCA 
32 881 CCGAGCCCAT GAAATTTTGG ACGTGTGTTT 

32 941 TAGTTAAGCA AGGCACCCAC GGGAAATTGA 
3 3 001 GCAAGTTTGC GCTACCGTTT CGTGCGACAG 
3 3 061 GGCTGGTGGG AAACGGTAGC ACAAAGCCAG 
33121 TGTCCGCAAC GGTGGTTTTT CCGCTGGTTC 
33181 ATCTGTTTCA AATCCTATCT CTTTTCTCGG 
3 3241 CGCTCAACAG TTCGGAGGCG CACAGGCGCA 
33301 GAATCACGCA AAACTACTTG CGACAGGTAG 
33361 GGGTGATGAT GCAATATCTC ACACCGGAGC 
33421 CCATAGAGGA CATATGCAAT ATGAATTCCA 
334 81 AACTACATGA AC7VGAGTATG ATACCCATGG 
33541 ACGTGACCTT AATGGAGGTC TGTTTGGGAC 
33601 TAATTGTTGA CGCAGGAGAA CATCTAGATG 
33661 GGCAGGTGAT GTCAAATGAG GCTATTAAAC 
33721 GCTACATTCA AACGCTAACC AAATTGGAAG 
33781 TTTACTATAA TAAGGTGTTC TGTAACGGAC 

33 841 GAGTTGGAAT TTAATGGAAC AATTTACCAA 

33 901 ACATCTATTG TAATGGAAGA TGCGTGGTAC 
3 3 961 ACTTTGGAAA AAAGAAAAAG TTTGCTGCGA 

34 021 TATACATTTT GTAAATCTTC AAGTGACCAT 
34 081 AGTTCTTTGC CAGGTTTTAC TGGGAATTTT 
34141 GTTTTATTAA CGAAACGTGA ATTATTTACC 
342 01 GCGGAAAAAA TAAATGCAGA AATGTTCAAG 

342 61 TCCGGCATTA TAACAGAAAA TTGGATATTA 
34 321 AATATGCAAT GTGTTGCACT GTTGTTTGGT 

343 81 TATGTATCTT ATCGTGATTT ATTGGTCGTT 
34441 GCGCCGAAAT CTTATAATAC TCTTGATTTG 
34501 TTTGTTTCAG TTATAGATAG TCCGCTTAAT 
34 561 GCTATTGAAG CAAAGGGTGC CTGTCAAAAT 
34 621 TTTTTTGAAG TTACTGCAGT T^AATTTTTTA 
34681 GTTAACGTTG, GGTGTGTTGT ACGGCATGTT 
34741 AGGCTCTGCT ACCCAACGTT TAAGTTATAT 
34 8 01 GCAGAGTCCC AGGTTTTTAA CTTGCCGGCA 
34 861 CAAGAGGTCG TATTTTCAAT GTTTAAAATT 
34 921 ATTCTAAACG AAATTGTATA TATTACTAAT 

34 981 CAGCTAACCA ACACATTTAG GCGGAATGTA 
35041 AAATTAAACG TCACAGATTC TTCTGTGTTT 
35101 AATAATGTAG AGATATCATA TATGATTAAT 
35161 TTTCGCGTGT TTTCGCCTTG TTTTTTAAGC 
352 21 CGATCCGACG CACCGCAGAC GTCAAAAAGA 
35281 GGTTTTTGGC GGCTGTTGCA CGCTTTTCAC 
35341 AATTGTACCA GGCTGGCATG GAAGCAAGTA 
3 5401 TACGTCATAA GCTCCGTGCG CCCTGATCAC 
35461 CTTAACAGCG CGATGTTTGT CTCGGCCGTT 
3 5521 GGTACCGCCC TACACATTCC GATATTGTAT 
3 5581 CTGTGTGATT CGATTGTTTT GAGCTATGAT 
35641 GTGTCAAACC CCACAGTTCA AGCCAACCTG 
3 5701 GATAATTTTC ATATCCATTA TTTGTGGTTA 
3 5761 GGATTGTACA GAAGACACGC ACTCAGTGCT 

35 821 ATGTCTGCTC TGTATTTTCT GTTTAAGCTG 
3 5881 GCGTAAAACT TTAAAAGGTT GTTTCCCGTT 
3 5 941 TTAGCGAGAA CCTCGGATAG AGGCGTGCCG 
360 01 ACTTGAATGG CCAGCTCTCG ATTTTGGGGA 
3 6061 GTTAATAATC GCTGCGGAGC CCGCGACAAA 
3 6121 TCTATGGAGT TTAAACAAAA CACCATGTCT 
3 6181 GATTCGAAGT ATTCTCTCTG ATACGGGCCC 



u '^M O ^ li 0 1LJ Q tn Oi 5 iij b:^ o 




PCT/US99/26260 



TGGTGCCGCA GGAAAACGTC TTAAGCTGTC 
ATTCAAATTG CCTTAAAGAA CAGCGCAGCA 
TCACTTCTGC TCGCGTATAC GCGTGTCAGA 
CCGCCGGCAT CGGTCGCAAC CTGCAACCGT 
CGAATTGGAT TGTTTTTGAC AGACACCTAC 
ACGTAAAGTA CAACAGGCTA ACGCCGGATC 
CACACGACGG CGATGTGGTC GTTTTGCTAA 
TTCAAAGTCG AGGCCGTAAG GAAGAAAAAG 
CGTGGGCGTA CCATGCCGTG TTCTGTACGT 
AAATGGTTCA GCTGTGTGTA CAAACTGTGT 
GATTGACTCA TCGGTTTCTA ACCCTAACAA 
TGGCAGAAAT GTTAGTATCG GTTAAAGAAC 
TCTTTAAAGA GCTACGAAAG CTTCAAATTT 
ATGCGTGTGG CCTCTGGGGA AATATTTATG 
CACGAGCAGT GAACTGGCCA GCTCTTGAAA 
GCAATGGCGC GTATTAGTTT TATATTTTTT 
AAATACGTGT ATGATGAAAA ATCAAATGTA 
ATTAATTGGA GAAACGTAAG CAAAGAATTA 
GATTCTTTAC TGCTGGAACC ATTGTCTGTG 
TCAAGTATAG TTAATGTGCA TAATAATGAC 
GTAATTAACC TTACGGTGGA TTTTAATTAC 
AATGTAATGA CACATGCTTT AACACAAGGC 
AACTCAACAA ACATTATGGA TCTTTTTTAC 
ATTACTTTTG ATTATTCCT^ TGTAATAATT 
GTATCAGTTA CAAATTCAAG TGTAAAATCA 
GTTCCTAGCA CATTTCCAGC GTTAAAAGGC 
AAAAATTCAA ACTATGCTCT GGGTGTAATT 
GCATTTTTAC CAAAAAATTT TACAGAAATG 
GCCATTGATT ATCTCAAAGG AAAACTTTTG 
CCTTCAAATG AAAATGATAT CTTATCTTTC 
tttataaa;^ ATCTTCAAAA GCAGCAATTA 
GCAGCGTTAG AAAGTTTAAT GCATTTATTA 
GAGCTAAATT TGGAAACGTT GTCGCATATT 
AATAGCATGT TATCGTTATC AGTAAATGAT 
GTATATAATA CACCTAAGGT TGGTGGTAAA 
TACATGTACA CCAAATACAG TGAAAATTAT 
ATGAACATGT ATGAAGTTTT AACAACAATT 
TATCCGTATA TATTGTTCAC ATCCATGTGT 
CAAATAGCAA AGCCCGATGA CATCACCATA 
TTAAGGTTTG ATTTAGATGA AAACAAGCTA 
ACCGGCTCGG AACTGGCACA AGGAGCGTCT 
GCCACGCGTA TAAACGAGTT TTCAGTCATA 
ACCGCGCTGA TGCCGCTTAC GAACATCACG 
GCTCGCGTTT ACGAGGTTTC GGAGGTGTTC 
TATCCAAATT GTTCTCATTT TACACCGCCC 
AACTTTTCCG CCCCGAGGAT AGGATGTCCG 
GAGAACCAGG GACTACAGAC CATGATGTAC 
TTTTCCCCGT ATTCTCCTTT CTTTGACAAC 
ATGAACAACG GTACCGTTGT GGAAATTCGG 
ATTGCATTAG TATTTGCTTT TATTGGAACA 
TTTTCCATCC TGGCCTAAAC TGTTAATAAA 
TCTTTTTGGA TGGTTACGCG GGGACTGAGA 
TCACATATTG ATTCAATCAC CTCTGCGCCA 
GGAAACATTA AAAAATATTT GTCGGTCGCG 
ACTCGAAAAT CCTTCTGAAA GTCGTGGGAC 
TCCAGGCGCT GTTTTACGCG CGAGAGCATG 
GCCGCCTTAC ACGTATTCGC GTATAGCATT 
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3 6241 TTAAAGTGAA ACAGAATATT TTCCAAACCC AAACGCTGCA ATAGGACGTG CTTTAAAAAA 
36301 AGGTGTTGGA TAGGCTTTAG TTTTAAATTA GGTAAAAACA TTATAGAGTT GTTGGCGCGC 
36361 TGCATAATTG TCACGGTGTT AAACGCGGTA ATCGGGCTCT CGCCTATTTG TGGCTGGCGC 
36421 GAACCCATGG CAAAAAATAT GTCTGCCAGT AAACAGTCCG ACTGTACATT TGACTCGCTG 
36481 GTCGCTGGCG GTTGCTTGAG GATGGATCGA AGTTTTTTAT TATCCGGCGG AAGCGCGCTA 
36541 CATAAGGTCG GTGGCGCATT TGGATCGTAT TCTTGCGTGG TGTTTGCCAG CTCGTTTATG 
36601 GTTATCTGTT TGTTATTTGT AAGAGTTACC GGTGGGGTCT TGACGGTCGC CGGGACGGAA 

36 661 AA^GGCCTGC AGTGCAAAAA AAGAGAGTTA CTCGCAATCA CTAATTTTGA GTTTACGGTA 
36721 ATT^GTGTGT CCTCAAAGTC AGAACTTCTA ATTACTGGGT CTCTGAACAA TGATACGCCT 
3 6781 CTCTCGCCTT CGTCCGTTCC CACGCACAAG CAGCTGTTTA GCGCGTAGGG ATGGGGTGAC 
3 6841 ACCATAATTC GCAGCATTGA TAATTCCGGA CGTTTTGGGG CGTACAATGG CATCATGTTT 
36901 GAATAGATGG TGCTGAATTT TTTAAACAAC AGGGGAAGGG GTACGGCCGG TCCCGGAAGT 
36961 CCGACCATTG CGGCAAGTGC CGACGAGTAC ACGACATGAC AGACGTACTC GCCGTTGGAT 
37021 AGCTGCACCG CCCCCCCAAC TAAGCGAGCC TTTAAATCTG AGATTTTTAT CATTGATTTT 
37081 CCTTTGGTAG TTTCAGGAAC CGCAGCGGCC TTGGCTTCGC GGTTATATTT TTGCTTTATG 
37141 TCTCCAGTGA AGTCCCAAAC GTCTAATAGT GTGTCGAATT ACCTGCGGTT TTAAGTTTGA 
3 7201 AAACGCGTGA CCGGTCGTAG CTATGAAAGA CATAAAATTG GCGTTGCGTC CTAACTCGTC 
37261 TAACCGTATT TTTTTACTTC GCACGTATAT CTTTGGAATG AGGTTGCATT TTTTATATCC 
37321 CAGAATAGCG CATTCGTGTT TGCTTTTGGT GTGACTAATG ATGCGTTTTG AAAAGTCAAA 
37381 CGTACCAAGG CGGGCGGCGC GGTTCAGGGC TCCCAGATCC AAACGCAAAA ACGAACAACA 
37441 TTCCACCAAA CT^CTCGATAA AACGCTTGTG TAGAGTTTGT ATTTGCGCTT TATTTCTTGT 
37501 CTCCAGCGTT GTGAATATTG TGTTTTTCAC CTGGTCTATT CTAGCATATG GAACGCTTAA 
37561 TATTGGGTTC AATTCATCCG AAAGTTTTTT AAGTAAGGGC CTACCGTATA TTTTAGAAAT 
37621 TGTCGCAGTT GCCGCCTTAA ACCAGGAAAC GTTGCATCCG TCTTTACTAG AAACCACAGA 
37681 AGGGAAATTA GTGGTCCAGA AAACGTCTCG CCGCCCCCGT CTAACTATGT ATTGGTTTTC 
37741 AAGCCCGTGG AGGTCTAACG TAGAGTTCCA GTTCGCAATC GAAACCGGGA AAATGTGCTT 
378 01 CACCGGTAAA AAGCACCTGA GGTTTCCCAT AGCGAACATA AAGTGCAGCA TGTCCCCGCT 

37 861 TATGTTAACG TTAATTGCAG CCCCGTTGCA CAGGTTGTCG GAGTAGAACA CACTCACGGT 
37 921 GAATGACGGT TCTTTCACGG AGAATTTTTT CATTACAAAG TTATTAGTGA GCCTGGGGAT 
37 981 GTCGATAATT GACTCGAAGG TACCGGCGAG AGACCAGCAC GTGATTTTGG TATTAATTAC 
38041 CATATTGGTA TTGAAAACTG AGATTTTGAA GTTGTGTAAA CTCATGTGCT TAACTGGATC 
3 8101 GAACATGTCT GGAAGAGGTT CGGTCTGGTG GTCGTGACGC GCTTGTTCAT ATCGATCTGG 
3 8161 GTTTTGATCG GCGTTAAAAA TAAAATTGTA ATCTCTATCT TTTGGTGCGT ATTGAATTCG 
3 8221 CGAAAACATG GCCTCTAGGA CGCGAGGCGT TTCGCCTGGC CCCAGAAACT CGGTGTTAGG 
38281 ATACAGCCTG TCGCACGAAT GGCACACGTG TTCCGCCACA GACATGCACA TTGCGACGCG 
3 8341 GAATATTGGT ACCTCGGGTG GAAACGTTAG ATAGTACGTT GACTTTGGCG TCACAACAAC 
3 8401 TACCTTGTAC GGTTTTTTGC CGGGATCCTT TTTGTTAATT TTCACCTGTA GATTTACGTC 
3 8461 AAGCGATTGA TTTACTATGT TTTGGTAGTC TATGATTAAT TCGCTCGCCA CTTTGATGTC 
3 8521 CTGAAATTTG TTTTGCTTTG AGAGGTTTTT TAAAAGTACC GGATTGGATT CGATGTTGTG 
3 8581 AATAATTACC GTTTGGACGC GATGCAGTTC GGGATGCGAA ATCCAATCCG AAGTAAAACC 
3 8641 CAAAATGTTA AATTTTAAGT AATTCAGATG CAGTTTTGCA CACGGGGGCG GACACCGAGC 
3 8701 TAGCGTGAGG ATTTTGAATG GCTCTAAAAG TTTAAAGTAA TTCTGATAAC ACGTGCCGAT 
3 8761 CAGATGTTCC GGCAGTGTGC GCGTAAAGGC AGCAAAGCAT TCCTCGAATT CCCATATTTG 
3 8 821 AAGGTAACGA CAGAGGATTC GAAACATGCT TTGATAATTT ACGCCACATA CCATTAATGG 
3 8 881 CGAACAGTAC AGCGACCGTA GGATTCGTTC GTCGCTTTGA ACCAAAACCG TGTCGGACAG 
3 8 941 TTCGTAGGTG AAAACGGTGG GTAAAAACGT ATGCGTCTCG ATATGGGAAA ATGTACATCC 
3 9001 GATAGCATCA TCAATTATTA AACCCGGTCC CGCGTACTCG TTACCGGTCT TGCCAACGAT 
39061 ACCACCTTGG CGGTGTATAA AAGGGTATAG TCCTTCTCCC AATACCAGAC GCCGGAGCGC 
3 9121 ACGGCACGCG TAAAAGGTGG TCTCTGTTTG CATAGCCCTG TCCAAGGCCT CTAGCGAGAC 
39181 GTTTTCAAGA ATCTCATTAT TGGTGGCCAG GCAGCAAAAA ATTGCCAGTT GAGTTTTAAA 
3 9241 ATCGGAATTG GCCGGTGACG GGGCGTTCGC TGCCACCGTA GCAGGACACG CGGGCAGAAG 
3 9301 7U\CAGGGCCC TGCAACAGCA TAATGGAGGC GGCTCTGGAG GTACGACCCT TTCCATATAT 
3 9361 GGCCACGGAG GCCAATCTGC TTCGTCAAAT GAAAGAGTCG GCAGCCAGCG GACTTTTTAA 
3 9421 AAGTTTTCAA CTGCTGTTGG GAAAAGACGC CAGGGAGGGA GGAGTTCAAT TTGAGGGTCT 
3 9481 TTTGGGCGTG TATACCAACG TAATTCAGTT TGTGAAGTTT CTGGAAACGT CGCTAGCGGT 
3 9541 TGCGTGCGTT AACACCGAGT TTAAGGACCT CAAACGAATG ACGGATGGAA AAATTCAGTT 
3 9601 TAAGGTATCT GTACCGACCA TCGCGTATGG GGACGGCAGG CGGCCCACAA AACAAAAACA 
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3 9661 ATACATTATC ATGAAGGCCT GCAATAAGCA 
3 9721 TGATGACATC GAGCTGCTAT TCATTGACAG 

3 9781 CGGGGCCGTA AAGACGATTA CCGCCTCTCT 
39841 CCTGGTAGAT ACCGTATTGA ATGTTAAGCT 
39901 AACACTATCA GACCCGGTCT ACACCGAACG 
39961 GGTGTCCATG TTCAAAAGCT ACCTTATGGA 

4 0021 CGCCGTCAAG GGAAAGCAGT ACGTGCTGTC 
4 0081 TCACGAAACG GTTTTTT^GG GGACGAATAC 
4 0141 CGGAGTCATG GAGACCACGG AAAACGTAAT 
4 0201 TGACGGGGGC ATGTCCGGTC CGGCGTCTTA 
4 0261 CGTAACCGCC GTGACGTACG GTCGCGTCAT 
4 0321 AGTGGACCGT CCCAACGCGC AGCCCAGCGT 
4 0381 GCAGGACTCC CTCGCCAAAA CACCGATCGC 
4 0441 GGTGGCTTTG GAAAGTTTAC AGCGAATGTA 
4 0501 CAGGCGTATG CACTATACTT attttttccc 
4 0561 TTCCACGTCG GCCACAATTA AGGGGGTCGA 

4 0621 GATTGTAAAT aaaaacaacg tactgctgag 
4 0681 ctgtcatccg cgcatgcaca acccgatgcc 

4 0741 AGATCCCGGA CACGTTCACA GGTACGGACA 

4 0801 ATACGGGCTG GTGTACAACT ATTACCAGGG 

4 0861 CCTAAAGGCG ACGATGACCA CAGATGAGTT 

4 0921 TTTGGAGGTT CACCCGATGT TTGATTTTTT 

4 0981 GTATAGGGCG ACCCACAGGA CTATGGTGGG 

41041 GTTTCAAAAC AGCAGAGGCC TGCAGTTTGA 

41101 CCAGTCJ^CG ATGGAAATTA TCCAAGATAC 

41161 TTATGTCATC GAATGCCTCA TTCACGGACA 

41221 AATTGCATTA ACCATTGAAA CTTACTGGAA 

412 81 CTTCCCTATG CTGCGATTTA TCTGCGTTCA 

41341 GTACGCCCAT TACCGAAAAG TTTTTGGCGA 

414 01 AATCGCGGGC CACGAGGTGG TGGGGCGCAG 

414 61 GGACCCCAAT CTTTTGCCGC CCTTTGCTTA 
41521 GTCCTCGCGG CACCCCATGG TACTCATAGG 

415 81 GGATACGTAC ATCAACGTCA GAGGAA7VAAT 
41641 TTACGAGACC AGAAACAACG CGGATCATGA 
417 01 TAATGAAAAC GAACAGCACA TGGCTGTGCT 
41761 AGCCTGTACC AACGGTCACG TCTGCGGCAT 
41821 CCTGACGTAC AACGGCCCAG TGTTTGCTGA 
41881 CCACCTGGAG AACGGGACGC TCCGCGAGAT 
41941 TGACATGATT CGAACTCTTT GCACGTCGTT 
420 01 CCGTGTTGTG ACTCAGCGGG ACCCCGCGCA 
4 2061 CGTGAGCCAG ACTGTCCTCG TTAACGGGTT 
42121 TGACGTTGCC GAGACCATGT TTTACCCGGT 
42181 GGTGGCGGCC ACGCTCCACC CGCTGGTCGC 
42241 CGTGCCGGTC GCGTTTAACG TCCCCCCGGC 
4 23 01 GTCTCCAATG CTGGCCTACG CTAACACCTG 
4 2361 GGCGAGCATG CACATG7VAGC TGTCCGCGCC 
4 2421 TCACCCGGGC TTTGCGATGA CCGCCGTCCG 
42481 ATTTAGTGCC AGGGCCTCGA CGTCCATGTT 
42541 AGTCAGGGCG GACGCAGTCA CGTTTGAGGT 
42601 GCTCGGTTAT TCTTCCACCA TCACGCCCGC 
42661 CGTTCACTGT CAGGACATGT TTCTCATGTT 
42721 CAACGACTAC GTTAAACAAA AAGCCGGATG 
4 2781 TGAGCCCGTC GCTTACGTTG CGGGGGTGCC 
42841 CGGACAGCTG GCCACGTGTG AGATTGTTTT 
4 2901 TCAAACCCCC AACAGTCCCC GGGGACGGGC 
4 2 961 CAACGAAAGC GCGGAACGTT TGCTCTTTGA 
4 3021 CCGCACTACG GTTAACCCAT GGGCGTCGCA 



TCACATTGGT GCCGAGATAG AGCTGTCGAC 
AGAAACCCCA CTCGATTACA CAGAATACGC 
CCAGTTTGGC GTGGACGCGC TGGAGAGGGG 
TAGGTCCGCC CCGCCGATGT TTATTCTAAA 
GGGTCTAAAG AAGGCTGTTA AGTCAGACAT 
TAACTCGTTT TTCCTCGACA AATCAGACAT 
GGTTCTCTCC GACATGGTGG GGGCGGTGTG 
GTATCTGTCT GCATCGGGAG AGCCAATTGC 
GCGAAAACTG TTAAACATGC TAGGTCAGGT 
CGCCAATTAC GTTGT-CAGGG GCGAAAATCT 
GCGAACGTTT GACCAGTTCA TGAAACGCAT 
CGACGATGAT CGGGACGCGG TGGCGGACGG 
GGCGGCTGTT ATCCAGATTG GAGATAAACT 
CAACGAGACC CAGTTTCCCT TCCCGTTAAA 
CATTGGGTTG CACATGCCCC GTCCTCAGTA 
GCACCCAGCG GAACAATCCG TCGAAACGTG 
TTTCAATTAT CAAAACGCGC TCAAGTCCAT 
GTGTGGGCAA GCTTTGGGTC AGGCGTTTCC 
AAGATCTGAG CACCCCCCGA ACATGAACCT 
CAAAAACGTG GCACACGTTC CGGATGTCGC 
GCTGCACCCA ACCTCGCACG AGACGCTTCG 
CGTTCATCAG CAGCCTGGTG CGCAAGCCGC 
TAACATTCCA CAACCCCTGG CGCCCAACGA 
CAGAGCGGCG GCCGTGGCTC ACGTGCTGGA 
GGCGTTTGAC ACGTCGTACC CACTACTCTG 
GGAAGACAAA TTTTTGATTA ATTCTCCTTT 
CAATGCCGGA AAACTGGCGT TTATTAACAG 
CCTGGGCAAC GGTAGTATTT CTAAGGACGT 
ACTCGTGGTT TTGCAGCAGG CGCTCTCGAA 
GCCCGCGTCC GAGCTGATTA ACTGTCTTCA 
CAATGACGTT TTTACCAACC TGCTCAGGCA 
CGACGAGGGG TACGAAACGG AAAATGACAG 
GGAGGACCTA GTCGGTGACA TGGTTAACAT 
CGGCCGCCAC GTCCTTGACG TCGGTCCCTT 
GGAAAAGCTT TTTTATTACG TGGTTCTGCC 
GGGCGTCGAT TTTGACAACG TGGCCCTGGC 
CGTCGTGAAC CCCGACGATG AGATTTTGGA 
GCTCGAGGCT TCGGATATAC ACCCCACCGT 
TCTCACCTGC CCGTTTGTTA CCCAGGCCTC 
ACTGTTGACC ACTCACGACG ACGGGAGATA 
CGCGGCGTTT GCTATCGCAG ATAGGTCTCG 
GCCGTTCACC AAGCTGTACA GCGATCCCCT 
AAATTACGTG ACGCGCCTGC CGGCCCAGCG 
CCTCATGGCC GAGTACGAGG AGTGGCACAA 
CCCGATGACG CCCACGTCGT TGAGCACCCT 
GGGGTTCATC TGCCACGCAA AGCACAAGAT 
AACCGATGAG GTGTTGGCGG AGAACTTGCT 
TTTAGGGCAG CCATCGGTTA TGCGTCGGGA 
GAATCATGAG TTGGCATCGC TGGACATGGC 
CCACGTTGCG GCGATTACCT CGGACATGGG 
TCCCGGGGAC TCGTACCAGG ACAGGACCCT 
CCAACGATTC GGTGGTCCTG GCCAGATTCG 
GCACTCGGAC AACATACCGG GTCTCAGCCA 
GACGCCCGTT ACTGCAGACG TTACCTATTT 
ATCCTGCGTG ATCTCGTGTG ACGCGTACAA 
CCACTCCATC CCGGATTCTG CCTACGAATA 
GCAGGGCTCC CTCGGAGACG TGCTGTACAA 
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4 30 81 CTCAACCTCG CGCCAGGTCG CAGTGCCAGG 
4 3141 CAAGGACGCT ATTTTGCGTA ACAATCGGGG 
4 3201 CCGCCTCACG GGAACGCCGG CGACCAGCGC 
4 32 61 AACGGATGTG TTTCTAGAAC AACCGTGCCA 
4 3321 CGCCAGTCAC AGGTCTCTGC TGGACGAATA 
433 81 GCACATGGGA CATTATATGA TTGAGGAAGT 
4 3441 AAACAAGGTC GCCTATTAGT TTAGCTCAGA 
4 3501 ATAAGAGCAT CGTTGTCTCG GTGACGTCTA 
43561 AGTCAAAGAT AGGATGCATT TTGCCTCTCA 
4 3 621 CGCTGGGTCT GGGGAACCTG TGCTCTAGGG 
436 81 ACTATTTGGA CAAATGCACT CTCGCCGTGT 
4 3 741 TAACGCGCAT TGATCCCATG GACAATTATC 
43 801 ATTGGGATAA CTACTCAGAA TTGGTAGTTA 

43 861 CCGTCTCACT GGAGTCTAAC GGGTTTGATG 
4 3 921 TGGCTCAAAC AGTGCTTCAG AAGCTGCTGC 
4 3 981 CGACGCCCAC CGACGTCAAC CTAGCCGAGG 

44 041 TGGGTCGCAA CTACGCCCTG GACGTGGACC 
44101 TGGACGACCT GTCCATTTAC CTGTGCGTTT 
44161 GGCTACTGAC CTCATTGGTG CGCCACAACA 
44 221 TGGTGCCACC TGAGGTACAG GCCCTGGATC 
442 81 CGCGCATGGG TGCCCTCATA ACCTATCTAC 
4 4341 GCAGATTTCA CGTTTACGCG TTCTCATCGG 
44401 ATAACTAGAA ACGGGCCTCC CTGTGCTTTG 

444 61 CCAGAGACAA CGAGGGTACG GTGTGTGAAG 
44521 TGATTTATTA CCCTAAGACC ACCAACTTAG 

445 81 CTTTTGATAC CGAAACCCCA GTGGACATTG 
44 641 TATCCATVAAA AAATTGCGTC CGTTTGGCGT 
44 7 01 GCTTTGCTCT GCTCCTCACT TGGCAGTTAA 
44761 TTGCGGTTAA CCCCATGGGT CCAAAACATG 
44 821 CGGTAGGTTG TGGGGCAGAC GCCATCTGTA 
44 8 81 AGCTTGGAAT GGTTTACCAC AACGGGTCTA 

44 941 TGGCCCTGTT AAAGGCGGCC TGGCTTGTCA 
450 01 GGCAAGGCGC TGCATTACTT GGTCCATGGA 

45 061 CAATGGCGGC AACCACGTTT TGCGGATCCA 
4 5121 AAAACCTCTG TCGCCCCACC TGGAATGATG 
45181 CACTCTGTCC AAATATACCT GTGGTGCCCG 
4 5241 GAGAACAATA AACGTATTCC CACGCACTTC 
4 5301 TATTAACAGT AATGGGAGGT TCCGCTGGTG 
4 5361 ATATTGTGAA TGGGGAAAAT AGTCTGTAT^ 
4 5421 CGAATGGGGT TTTAACCACG ACGGGCTTTT 
45481 CTCCATTTTA TCGCGTAATT ACAAAACCTC 
45541 TGTGCGTTAT CGCGTTGGTA TGGTACGTGA 
4 5601 TTGCCGATTC GTGTCGCGAC CCGCGTCAAC 
4 5661 GTCCCCACGG AACCAATCCA TAGAGAACTT 
45721 TCTCTAGCTC GTTTATTCTA GCGATTTAAA 
4 5781 GGCCATGACG GCGGCGACAA GCACGTCGTC 
45841 GTGACCTCCG TCTTTTAGTG GAATGCAACG 
45901 TACGGGATCA AACGATAGCT TAATAGTGTG 
45 961 TCCCGAGTTC AATGCGTAAA TAAAAGATTC 
4 6 021 GTAAATGGGC GAACGTATAA GAGTGTTCTT 
46081 CGAGCAAATC TCGTTTAACA CGGTTGCTAT 
4 6141 TTCCACGGCG ACGTTAACGT GCGTGATCTG 
4 62 01 CGCGGCGGCA CATGACGCTA TCTGGTACGC 
4 62 61 ATGTTCCACG CCTAATAAAA TGCATTTAAT 
4 6321 GATGCCGGTG CCGGACGCGT CTGTGTTGTT 
4 6381 ACACGGCTTC ATCGCGCCCT GTATATCTTG 
4 6441 TTGTGACAGC GAAGAGTCGC TAACGATTTT 



GATGTACAGT CCGTGTCGCC AGTTTTTCCA 
CCTGAACACA CTAGTCACGG AATACGCGGC 
GACGGACCTG CAGTACGTGG TGGTCAACGG 
GTTTCTACAA GAAGCGTTTC CCACGCTCGC 
TATGTCGAAT AAGCTCACGC ACGCCCCTGT 
GGCCCCTATG AAAAGACTAT TAAAGATCGG 
CGGTCTGGAG CTAACGAGAG ATGGCCCTCG 
GATTATTCGC CGACGAGATA GCAAATCTTC 
GAGACGCCCA CCGTCTGCAG AATATACAGG 
ATTCCGCGGT GGATTTTATT CAGGCATATC 
TGGAAGAGGT CGGTCCCAAC AGTTTACGGC 
AAATAAAAAA CGCGTACCAA CCGGCCTTCC 
TACCACCGGT CTTTGGGCGC AAAGATGCGA 
TGGTTTTCCC TGCCGTGGTG CCAGAACCAC 
TGTATAACAT ATACTACAGA GTGGCGGAGA 
TGACGCTGTA CACGACCAAT ATCACTTACA 
CCGTTGGGTC GAGCTCAGCT ATGCGGATGC 
TGTCCGCGTT AATTCCGCGC GGGTGCGTAA 
T^CACGAATT AGTCGAGATT TTCGAGGGGG 
TCAACAACGT AAGCGTGGCC GACGACATAA 
GAAGTCTCAG TTCTATATTT AATCTGGGCC 
ACACGAATAC CGCTTCCTGT TGGTGTGCAT 
ACATGTCGAT CCCCAAAATT ATGACGGTGT 
TCGCGGTGGA CAACGGACGA CACAGAGCGA 
C;U^CGAGCG CGCGGACGTT GTTAAGGAAG 
TAAAGCAAAT TGTTAACGAG GGCCTAGCTA 
TGTATTTATA TTTTTATTTG CAGTACGTGT 
ACCCGTACAT GGACCCACCG GGTCTGGTGT 
TCACGAAACT ACCGCACCCG GCTATTGTTG 
AGAACTGTAG CGTCCCCGAT ATCAAAACGG 
GCGATTCTGG TCAGCGCGCA CACTATGGGC 
TGGGAAATGT GTGTCCGGAA CCAGTAGTGC 
ACCGGACGGA GTGGTCGGAT TTTAAATCGG 
GAGGCGTTCT GTGGTCACCG ATTCATGAAA 
TAATTAACAC ATCAGTTTTT ACAAATGAAT 
AAAGTGTAAT AGTGCTTAAT GGTGATGCAT 
ATGTACGTTG TTTTTATTGG TCTGGTAAGG 
CCTATAAAGC AAAACGGTCA TAGAGTAACA 
TAGTTTATCC GAACAGTATG ACTGCACACA 
CAACAAGTCA GCCGGAATCG GTTCAAGTTT 
CCGTTATGGG CTTGTTTTTT TGCGTGGCTA 
TGCGGAGGGT GTGTTGTAAG GGGCGCGTTG 
CCGCGTATGA GATGTTGAAT GTTAGGTTGC 
TCACGTACAT AGCCAAAGCG CACGACGTGC 
AACAAATTTA TCGTTGGTTG CCATGTAGTG 
CGACATGGTT TTTTGTTTCG CGCAGTACGT 
TATTGCCTTG ATTTGATCTA TTAGATACGC 
AGACACCACA GTTTGGCTAG CGCTAAACGT 
AAACGCCTTG GCTTTCTCTG GCCCCAACAT 
GTCCACGTGG TGTAAAAAAC TAAGAGGGAC 
GGCCACTCCG GCATCTTGAC TGCTGTTGCC 
CGGGTGAAGG GTAACGATCG CTCGAATTAA 
GGCGGTGCCG GTTAGATCTC TTAGAAAAAA 
CACCTTGTGA TTTACCGCAA TCACCGCACC 
GGTGTACGCC GGGTCTATGT AAACGTGGAG 
TGACGTGCTC TTTACGCGGC ACAGATCCAA 
ATGCATGCTT TGAGCGGACG TGGCAGCGTC 
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4 65 01 CCCCATTAGC TCCGTGGAGA ATGCGCCTTC CAAAAAAAGA TTAGTCGTGC TCCGTACCGT 
4 6561 CTCATCAATC GTAATATAGG CAGGAATATG CAGGCGGTAG CACGGGCACG CCACGACAGT 
4 6 621 GTCTTGCAAG TTAAAATCAT CTTTATGATC CGGACATACG TAGTTCACCA CGTTAAGCAT 
4 6681 CTTTTCGTGG GCGTCCTTCA AATTAAGTAA AAAACTAGTT GATTTGTCAG ATGAGTTACT 
46741 TGAGGATATA AAAATCAGTT TTGCGTCCTT TTGAAGCATA AAACCCAGAA TCGCGGGGAG 
4 68 01 GGCATCTTTT TTAATGAAGT TCGCCTCGTC GATATATAGG AGGTGGAACG TCTGTCCCCG 
46861 GATGCTCTAA AGGCAAAGAA AAACACAAAA CGAGTTACGC GGGTATGGAG AACGATACGC 
46921 CTAAGGACAA AATCTCGGAA GCTGACTTTC AACAGTGTCA GGCGTTCTTT CACCGTCCCA 

46 981 TTAGAGATCT AATTTCATCT GGAGCTGACG CTTTAAACCA CTTTAGCCTA TCTGAATCAG 
47041 ACGGACATAA ATTGGAACGG ATTGTTCTTC TGCTTGACCT GGTGGGGACA GAATGTCTCT 
47101 CTTATACCAC GATCGCTGCA AAGAATGTCA AATGACGCGC GTCAACAGCC CAATATGTCG 
47161 ATTTCATAAC GTCTCTAACT TATACCAGTG TTTGGATTGT AAGCGCTATC ACGTATGCGA 
47221 CGGGGGACGC AACTGCGTGA TCGTGTACAC TCGCGAAAAT CTAGTGTGTG ATTTAACGGG 
4 7281 AAACTGCGTT TTGGATAATG TGCAGGACGT ATGTTCGTAC GGTCCTCCAG AACGCCGCGT 
47^41 ACCCGACGCC TTCATCGATC CGCTCGTGTC ACACGGCACG AGGGAATGTC TTAAAAGCGA 
47401 TATACTGAGG TACTTTGAGA CGGTCGGTGT GAAATCTGAG GCATATTCTA CCGTTGTCAA 
47461 GAATGGACAA TTGAATGGCA TCATAGGTAG ATTAATAGAC GCTACGTTTA ACGAGTGCCT 
47521 TCCGGTAATG AGCGACGGCG AAGGTGGCAG AGACCTCGCG GCGAGCATTT ACATCCACAT 
47581 AATTATCTCC ATATACTCCA CTAAAACGGT ATATGATAAT CTTCTATTTA AATGTACGAG 
47641 AAATAAAAAA TACGACCACA TTGTAAAAAC TATCAGAGCG CAATGGATGC GCATGGTCTC 
47701 AACCGGCGAT CCGTCGCGGG TCAGTGCGAC GGGTTGTTTC ACGTGATACT TCCGCGAGGG 
47761 TTTATCCTCG CGAACAATAT TACGTGCGGT GAACGGCAGC GGTTTTTTGC ACACACTTGG 
47B21 TTCGCTGCAT CTGGACGCAC GTCTAAGACT TTATACGTGT GGGGACGGGT ATTTCAAAAC 

47 881 ACCGACCCGG GCCGCGGGGA CGGTCCGTCC GGGCCGTGGT CCGGACTGGC GATTAGTCTG 
47 941 CCTCTGTTTA CCACAAATGG AAAATTTCAT CCGTTTGATG TAGTTATACT CAAGGCCGAT 
48001 ACGCCTGACT CTGGAAGCTC GTGGACCGTG AAGTTCTTGT ATATGTCATT AATTGCGGCT 
48061 TACAGAAACG CAATGCGAGG TTTAAAAGAT AAAGTTTCGC AATGTACCGA TGCCGCCGTT 
mil GACGGTGAGG TTCATCCTCT AACCGTCTTA AAAGAAGCGT TGGTATCACC GGACACTGCT 
4 8181 ACGCGACCCG TGTCCGCGTG CAACCCTCTA CAGATGTTGA CCGGACTCTT ACAGTCTAGG 

GTACGGGACG ACTACGTGAC ACACCACCGT GCGCTCGAAC GCCCAGGTAA TGTGAGGGGA 
CAAGTAATCG CCCCGACGCG CACCGAGATG CCAAACGGAT CGCCAAGTCG TGTAAGGCTT 
GGATTCCGCC CTCCCAAACA AGCCAACTAT CCAAAGACGT GGGCGCAGGC GCGTCACGTT 
48421 TTCTCGTCTC GCGCATATTA CGTGTGCGTA TATGATAACG AAGAACTAGA TACCAAGTGG 
4 8481 CAGCGGCAAG ATCCGCGGCC GTTGCCGCTA GATTGGTCCG ATCCGGTCGC GTACCTGTTA 
4 8541 GAGGGCGATT TGTTTTTAGG AGCCAAGCAG AATGCGTTTG TCGATTCTAT AGAAAAGACG 
tseol ?ScAGG?GTC AGAACTATAC CATTAAGCAA TTTTTTCCGG TTTTGATAAA TAGGGACAAC 
48661 GAAACAGTCG ACTTAATTAA GGAGCATTTT ATAGAGGCGT GCTTCGTGAT TAGAAACCAG 
Hill GTGTCAGAGA GGAGCGCTTG GGTAAAGGCG GCGCTGTTTC GCAACGATAG TAACACGTAT 
48781 TGGAAGGATG TTTTGGGATT ATGGGAGCAT GGGCCTCATA AGCTGGGTAC GGCTATAAAA 
tllll CTACcScAT CGGAACCTTG CAATGCCGAC GTAAACTGGA GCTGGCTCCT GTGTGACGAG 
HTol GATATAACTC GGTCAATTAG CGGGCAGTCT ACTGTTTGCT TAGTTGTCTC GCCTACCCTG 
4B961 ACCGCCTGGC TGGTGCTCCC GGGGGGCTTT GTTATTAAAG GCCGCTACGA CCTATCAAGC 
Holl GAGGATTTAA TGTTTGTGGC TTCGAGATAT GGCCACCCAG CGTCGTCACA TTCTTAAATC 

49081 
49141 
49201 



48241 
48301 
48361 



GTTTTTAAAC AAGGAATGCA TATGGTTGCG ACACCCGGGT ACGTCCGCGT TTGTTCGGGT 
ATACACCGCG ACCACTGCGC ATTCTGCCGT TTTTGACCCG CCGGTAACTA GCGAAAATGC 
4 9201 GATGTCACTT AACTTTTTAA ATGTTATGAT CGTAATCATG AAACCAAAAG AATTTGGCCC 
4926^ GTGCGTTACC GTGTACATGA ATGGAGATAT TCTAGATTTT TGTGCCACGG AATCTGTCGC 
4 9321 CATAAGGGAC GTGCCTGGTA GGGCGGACCT GTGTTTAATT CGTTTTGGTA CCCTTTCTAA 
TGCGCCGAGG AGCGTTCCGA TACCCGGGCC GTTGAACCCA CATCCGCGAG AAACCGTGCC 
CGGGCTAACA AAACAGGAAA TTATATACAC TTCGCAAACA GTGCCAAGAG GACAGATACC 
AGATGCCATA AAGGGGAAAG AGTTCCACCA AATAAATCCG TTTTTGTGGT TTGACGGAGG 
GGCGTTTTGG CAACTGTTCC TCTCTGTGGA TTTTATGCTG CTCTGTCCCG CACTCGACAC 
AGTTCCGTCC CTGGCCAGAA TCGTTGGGCT TCTTACACAG TGCGATAAGA GCACGTGTAA 
AATTTGTACG GGGGCCCACG TACACGTTAA CCCGTATCGC GGATACACGC CACCTGACTC 
GCAAGGGACC TCACCCTCGT GCCCCTGCCT TATCTCGTGC GGGGCCAGGC GCGCGGCGGA 
TGTCCTGGTT ACCGGACACG TTAATCTTTT GGGCCTGCTC TTTGACCCCA AAGCCTCCCC 
CAAAGTGACC AAGCTGCGTT TAAAAAGAAA CCCACGCCCG GTACCGATAG AGGACGCCAT 



49381 
49441 
49501 
49561 
49621 
49681 
49741 
49801 
49861 
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49921 

49981 

50041 

50101 

50161 

50221 

50281 

50341 

50401 

50461 

50521 

50581 

50641 

50701 

50761 

50821 

50881 

50941 

51001 

51061 

51121 

51181 

51241 

51301 

51361 

51421 

51481 

51541 

51601 

51661 

51721 

51781 

51841 

51901 

51961 

52021 

52081 

52141 

52201 

52261 

52321 

52381 

52441 

52501 

52561 

52 621 

52681 

52741 

52801 

52861 

52921 

52981 

53041 

53101 

53161 

53221 

53281 



GTCGGGCGTC 

TCGCCTGCCG 

CTGCTTACGT 

CTCGCGTACA 

CTGGATAGCG 

ACGTAACCCA 

ATCCATGTTT 

TGTAACACGT 

TGGTGTGTCT 

TGTTGTTGAA 

GACAGGTTGG 

TGGGTCCGTT 

CGACAACATG 

TGGCTAGAAA 

TACGTTTGAA 

AAGCATGGGT 

CGAAACCAGT 

CGATAGCTGG 

GAACGCCGCT 

CAGTTTAAAC 

GAATCGCGCC 

CTCCCAAAAC 

GACGACACGT 

CTGCCGTATG 

AAAATGGCTG 

CCTTTACCGC 

TGGCTATATG 

GCCCCCCAAA 

TACGTGAACG 

TTGCGCGCAG 

AAGTGCATGT 

GTGGGCGGAG 

GCTCAACGGT 

CTCGTAAGCA 

GGAGCTCGAG 

CGACAGTGCC 

CAGCCTAAAA 

TGAAAAACAA 

TGACGCTGTG 

TCTAGGTACG 

TATTAGGTGG 

CCAAGGTTCA 

GTAAGACGAT 

CGGACTGGAC 

GCCGCTGCGT 

ATATAGTACG 

GTTTCCTAGA 

TGCTAAACGC 

TGAGGGAAGG 

CAGAAAATAT 

ATCTAGGGTC 

AACTCGGGTT 

ACATTTACCG 

AGGCGCATAT 

TATCGTCCCT 

CATTTCCCCA 

ATTTGCGTTG 



ACGGCCGAAG 

GATTTAGCCA 

TCTTATTGAA 

CGATGGTTCC 

TGGCGATGAC 

CGTGAACGTT 

TCCCGTGACG 

CGTCCGTGGT 

TTTCGGGTGC 

AGGGCTCAAG 

CTAAAAACTG 

GGGCCGATTG 

TTTGTTTAAG 

TTCGGGGCTT 

GATAATCGTC 

GGGCCATTCG 

TTTCGCAACC 

TTAACAACAT 

TCGTTAAAGG 

TAGTTGAAAC 

CGGTTAGAGA 

TAACACGCAT 

GCGTTAACGT 

GACAGGACGA 

TAGAGGGCTT 

TCATATGGCT 

AACGCAATTT 

AGGACGTAAT 

CCGGTGGGGG 

CCATCGCTCG 

TGTGTCACCT 

TCATCGCATT 

GCACGAGTCG 

TGATAAGGGA 

TCGGAAATCA 

GTGTTTAACG 

AAGAAGGACG 

AGCGAATTGG 

GCGGACGCGG 

CTTGACAATG 

AAATTGGAGC 

GACTGACGTC 

CGCCCAACTG 

CGCCGTGCTC 

TAAGCATTTT 

CGCCTGCCGG 

GCCGTGCGTA 

GGATCTTAAA 

CGTTAGTTTT 

ACTGATAAAG 

CGCCTCTTTA 

CGTGCAGCAC 

GCCGTCGTGT 

GCTAAATCAG 

GGGTTATAGT 

GCAGGGAGCG 

CATGGTGCCA 



GGACCGAGGT 

GTCGCGTGAT 

GCACGTCGCA 

GTTTTCCTTC 

CTCGGAGAAA 

TTCAACCGAC 

ACGCGGGATA 

TTCAAATAAA 

TTTGATAGAC 

GCAACAAACC 

CCGCAGTTCT 

GTATTCCATG 

GTAGTTTGTG 

TCCCAACGTA 

TAAAACAGAA 

TTCGGTCTTG 

ACCGTCCGTG 

GTTCCCTTCA 

CGTTCAGTTA 

ACCTCTTAAT 

CTGTCCGCAG 

GCAGCGTGTC 

AAAGGCAAGT 

ATGGAATTGG 

GTCAAATCCC 

TCTGTTTTAC 

TGGTATGAAG 

GACTTTTGTC 

TCTTGAAACT 

TCTCGGGGAC 

GTACAAGCAA 

AGGCGGAGAC 

GGGAGATTTT 

ACATGGCCTC 

ACAAAAAACT 

CGCAGTATAA 

ATCTGGCGAC 

GATTACTAAA 

TTCGCGACCT 

AATCTGTTCC 

GTCTGCCCAG 

GACAGAACAG 

AAGATACCTA 

GGACGCGGCT 

GGCAGCCGGC 

GAGAAACATC 

CCATGTCGCG 

CACGTGAACC 

CTAAATAATA 

GGGGAACTGA 

CACACGGGAA 

ACGTACCATT 

CTCCTCTACC 

CCGTTCCAAA 

TTGCTAGCAT 

TTGGCGGACG 

AGAGTAGTCA 



GCAACCCACT 

GCTATACGGC 

CACATCAGCG 

TTTATGTTCA 

ACGGCATTGG 

GAGAGAAGAA 

ACAAAAACGC 

CCAAAGTGTT 

GCGATAAAAT 

CCGGGCACGT 

GCATAAACGG 

TACTGGTCCT 

AGTTCCTCGG 

GGGGCCGGCA 

CTATATGCAT 

TCGTTGCCCG 

ACCACGCGCA 

AGTTTTTTGA 

GCTTTAGACC 

AGTTTTCTTC 

CCGGAAGGGT 

CTGGGGCGAT 

TCCTCGAATT 

GCGCTCACGT 

ACGACCTGGA 

GGTTCCCGGT 

GGACCCATAC 

CATCATGTAA 

GAACCGTCCC 

GTGGAAGCGG 

AACGATACGA 

GGTGCGAGAT 

GTTTTAATCC 

GGCAGCAGCT 

GTCCATCTCC 

GGGAACCAGG 

CGTTGTCGGA 

GGGGTTTAAC 

CAAGGACGAG 

CGTGGAAGAA 

AGTGTGCCCC 

CATCGTCCCA 

ACAACTGGGG 

CGTATGGTGT 

GTGAGTTTTT 

CCCTGAACCG 

CCCTGATATT 

CTGAACGGCT 

TATGTGGCAT 

CAACTGCGTA 

CCCCTTGGAC 

TTAAGGCACC 

GGTGTCTGCT 

TCACTCCACA 

GCCTAGAGAA 

CTTCCTCCGA 

TCGCCGAGAT 



TCGCTACCGT 

TGCCAGAACT 

AACTCGGCCG 

GGTTCTTCGC 

CGACGTGTTT 

CGCTGATAAT 

TGGCTTTCTG 

GCTTAAACGT 

AAAACGTGTG 

AACTCCCATT 

GGCACTCTAG 

TGGCGTTTGC 

TTAACCGGGG 

TGGGTTCTTG 

TAACCGCGTG 

ATATAATAAC 

AGTTATTTTG 

ATAACGGGCA 

TGTGTGACAA 

TGGTATCCAA 

TTGACTTTGA 

ACTGCGACCA 

CACAGGGTGC 

TAAGGAAAGA 

AAGGTTTAGA 

CGTTCTGTCG 

TCTTACCGCC 

TTAAGTACGT 

CGCCGTTCGA 

ATGACGCATA 

TTTCGATTCA 

ATATAACGTC 

CCCTGTACAA 

AAAAAAATGT 

GTATTTGACA 

GAATCGCTGC 

ACGCTAGAAA 

AGGAAAAAAA 

CTGTACGGAG 

GAGTCCCCCA 

AAAAGCCCTT 

TATAACCGTC 

CCAGTGTAGT 

GGTGAGGTCC 

TTACGAGTGC 

CGGGGGTGAC 

CCCGCAGTTA 

GGCCGTTGAA 

CGTTCACTGT 

CGGGAGACTT 

CGGAGTGATG 

GGCCAGATTT 

GTCGTGCGCC 

ACTCGGTCTC 

ATATCTTCAG 

ATCCGCCCAC 

TTTTTCTGTT 



GGGCCCTCAT 

TAAAAAGCAT 

CCGTCCGGGG 

CGGAAACCAC 

TTGGTGGGCC 

GGCGACCACG 

CTTAAACGTC 

GGCAAATAAC 

CATGATTAAT 

TAAAAATGAC 

GAAAGCTTCG 

CATTTGGACC 

CAACGTAGCG 

AACTCTGCGG 

AATAACTCCC 

GGGCGGTTGC 

AAGACGTTCG 

CCCCGAAACG 

CACTCCGGGA 

CGTTCTGCCG 

ACACATTCAC 

TGTTAACAAC 

CTTGTTTTAT 

CAAGTTGGTT 

GCCCGTGGAT 

GGAACCAGAG 

ACATATGTAT 

TAAATTTTTA 

GGCCTCGCGG 

CCTGTCCGCA 

TGAAACACAC 

TAGTGTTCGG 

CATTGAAGGG 

TAATTAAGTC 

GGTTTGGGGC 

GGTCGTACAA 

CGTCGCTGCG 

TTGAAGAGTT 

AACTGGAGAT 

AGGACGACAT 

GATCCCATTC 

ATTAAAACAC 

CACCAGGCGA 

ATGTCTCTCG 

ATTTTTAACG 

CGTATACTAT 

ACAGGGAATC 

TTCTCTGAGC 

GACATCAGTC 

ATGATCGGAG 

GTGACCTCCA 

ATCTGTAAGC 

GGGGGCCCGC 

ACAATTGACA 

CCAGCTGACC 

CCATTGTTCT 

GCCTGGGACG 
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53341 TTCCACTCGA TTTAGGCATT GACTCATCTG GCCACGCGCC AGCTATTCCC CTGAGAGAAG 
53401 CGTACAGGCG GTTTTTTGCC AACCAGTGTA GTTTATATAG GGCGCAATAC AAAGAGGATG 
534 61 r-GTTAGAAAA CGCATCCTCG CGGCTGTGTA ACTCAAAACT TAAACTAGTT CTCCAGAAGC 
53 521 TCTTGGTCAG GGACTACTTT AGTCATTGCG GAAACTGCGG AGATCATGGA TTTTTTCTCA 
53581 GATGAGCCGA TGGTTCAGGA GATGGCGCTT CTCGACATCG ATGAGCAGCA GCGGCTCCTC 
53641 TCGAAAATGA GCCTGGCCAA CTTTTTAAAA CATGAGCGAG TAAGGGCGTT TTTTAGCGAT 
53701 AACAAAAAGG AAATAAGCAT GCCGGCTATA CGGTTCGTGT ATAATTTTTA TCTATTCGCC 
53761 AAGGTGGGAG ATTTTATCGG CAACACCGAC GTGTACGATT TTTACGTCAC TTGCGTGTTC 
53 821 AGGGGGAGGC GCCTGACGCG CCTGTCGGAA GTGTACGACG CGTGCCTAAA CATGCACCCG 

53 881 CACGATCGAC ACCACGTGTG TGCATTGATA GAACAGGTCA CGCGCGGCCA A/^CATCAAT 
53941 CCTCTGTGGG ACGCTCTGAG GGACGGCATA ATTTCGTCGT CAAAATTTCA CTGGGCCATA 
54001 AAACAACAGA ATTCGTCCAA AAAAATTTTT AACCCGTGGC CTATAGTCAA CAATCACTTT 

54 061 GTAGCGGGCC CGCTCGCGTT TGGACTGCGT TGCGAGGAAG TGGTTAAAAA GATACTGGCG 
54121 ACGTTGCTGC ATCCAGGCGA GGCGCACTGT GAAAACTACG GATTCATGCA GAGTCCTCTC 
54181 AACGGGGTTT TTGGCGTCTC CTTGGATTTT GGAATTAACG TCAGGTCTGA CCCAAAAGAC 
54241 GGTTTGGAGT TTCACCCAGA CTGCAAAATC TATGAAATAA AATGCCGGTT TAAGTACACT 

543 01 TTTTCCAAGA TGGAGTGTGA CCCGATTTAC GCTGCGTATG CTAAACTTTA TCAGAAGCCC 
54361 AGCATGCAGA CGCTTAAGGG GTTTTTGTAC TCCATATCTA AACCGGCGAT CGAGTTTGTC 
54421 GGAGAGGACA GGCTCCCCAG TGAATCGGAC TATCTTGTGG CATATGACAA AGAATGGGAG 

544 81 GTGTGTCCGC GGAAAAAGAG ACGCTTAACT GCAGTACACC ATCTAGTTAA AAAGTGCATG 
54 541 ATTCACAACT CTACGGCGCC TTCTGATGTG TATATATTGT CAGATCCGCA GGAAACCGGA 
54 601 GGCCAAATTA ATATTAAAGC TCATCTGAGC GCCAACCTAT TTATAAACGT CAGGCATCCG 
54 661 TATTATTATC AAGTGTTGCT CCAGTCTCTC GTTGTACAGG AGTACATCAG TCTCTCCAAG 
54721 GGAACTAAAA ATTTGGGAAC CCAGAAAAAC TTTATAGCGA CTGGCTTTTT TAGAAAACGG 
54781 CAGTTTCAAG ACCCGAGCTG CTGCACGATC GGTGAATTTG CCCCGTTGGA TCCACACGTA 
54 841 GAGATACCGA CCCTTTTAAT CGTGACACCG GTGTATTTTC CCAGCGTGGC CAAACACCAA 
54 901 CTGGTGAAGC AGGCGACCGA ATTCTGGGCG GCTAGTGCTC GTGAGGCATT TCCCGAGCTA 

54 961 CCATGGGATT TATCCTCTCT GTGTGCAAAC GCCCCACCAA CACCGTAGAT GTGAAGGGGG 
55021 AGCCCATAGA TGTATCCAAA GAATTCGATC CTATTATAGG AGAAGAAAGC ATTGTCTTGT 
55081 TAACGGCAGA TGGGACTGCC CCCGCGGCGC TGTACAAACC CAAAACCAAG CCATCCAAAC 
55141 ATAAAAACAA TAAATTGTCA GATTTTGTTT AAGCATTCGT ATCTTTATTG GAGTGGGGTG 
55201 GATGGTGTGG GGTTGGGAAG GGAATGGGAT TGAGGGGAGG ATGAAATGCT AGAATCATAT 
55261 GTATTTTGGA TATGCATCCT CGTAATCGCT CACGTCTTCT TCTGTTTCAT ATAACGCGTT 
55321 CAGTTCCGGG GAGTACTTAA CTTTTGTCGT CTTCTCTTTA GGGCCTTGAG GAAGGGGTGC 
55381 GTATTCGGGC GACTTTGCGC ACCATTTCCA ATCGTTTCTG ATAATTCTTA AGATGATGGC 
55441 TAGGATGCAA GTGATCGGAA TCATAGAGAT GTTCACGGCT ATGACTTTGT GCAGGCTGGC 
=;^S01 AGATATGAAT ACCTTGTCGT ATCGAACGAC GGGAAGAAGC AGAATCAGGT AACTGACAAA 
55561 AACGCCAACG TAGAATCCGA TCTGGTGCTT TAGGTACTTT ACTAGAAATA GTTCTGTGTT 
55621 GATATACCAA GCTATTGTAA GGACAAAAAA CAAGTTTATT GAACCAAAGA CAATATCGGA 
55681 AACGAGCATA TAGAAACTGT TTCCAATAGC CATCATCGTT CCGAGGGAGA ATACTAACAT 
55741 CTCCATGGCT ATTAAGGACA GATAAAGATT GGCGCCAATC GGTTTTCCGT AACGCAACAG 
55801 GGTGTCTAGC AAGCTATTCT CAGGAATCTG CTGCTCTAAG ACGCGTAGAG ACCAGGTGGC 
55861 CGAATTGCAC GATATCATAG CCTGGACGTG CGTAAACGAT AGGCAAAAAT GTATACAGTA 

55 921 TACAAACGAG GCTAAAAGTA TATGTTTGTA CGACAGAACG TGAATAAATA ACTGGATGGT 
55981 CCACAGCCTC AAAATAGACA TAAAAACTAC ACTGTGGGAG CCTATGAGCA CGATCCATGT 
56041 CTGGAGGCTG GTCATTGTTG TCGCGTGCAC GCGCTTGGCC TTGAATATGG CCAGGATGGC 
56101 CCACGCATAG TAACACAGCA GATATCCGTC CACCAAAAAC GCCCACGAGA TGTAGACAAA 
56161 CATTTCTGGT GCTTCCAAAA ACAAAGTCGG CGTCAGATGT TTAGCAGAAC TTCTGACCGT 
56221 TAGGTTTAAC GCGCTGTAGT TGACCAACGT GTTAAAGTAG CACGGAAATC CAAGTCCCGG 
56281 GAATGTGGCG GTCAGTGGCA CTACCGCTGA CATTATAAAC ATAAGTCCAA GAATAACCAG 
56341 TAGTTTTACC CATGACGATA AGATAAACGA GTCGCTCCGT GAAATTTTCA TAGTTCAAAG 
56401 AATCTGCTCG CGGGTGCGTT CGGCAAACTA GCAGTAAAAT CAGCAAGTGG CTTACATGTT 
56461 ACGGATTTAA ATAGGCTCCG CTTTTAAATA TCACATTCTG TGCAAAGGAG TTGAGCCACG 
56521 GCACCATGAA CGCCCGGGAG GTGGCACTCA CGGGACATGT TTTGCACATA TCGCTGCATA 
56581 GCACGCACGA GCGCGAGAAA TTAATAATCT GGCAGGTTCA TTTACTTGTA TGTCAACAAT 
56641 GCGGAATTCA GGGAGATGCC GCATATCTAT TTGTCACCGA AACATTAAGC AATACTGACT 
56701 GGGGAAATAT ACCGGCGATA AACCGTCACG CACCGTCCAT AAATGAGCAC GGCCGTAATT 
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56761 

56821 

56881 

56941 

57001 

57061 

57121 

57181 

57241 

57301 

57361 

57421 

57481 

57541 

57601 

57661 

57721 

57781 

57841 

57901 

57961 

58021 

58081 

58141 

58201 

58261 

58321 

58381 

58441 

58501 

58561 

58621 

58681 

58741 

58801 

58861 

58921 

58981 

59041 

59101 

59161 

59221 

59281 

59341 

59401 

59461 

59521 

59581 

59641 

59701 

59761 

59821 

59881 

59941 

60001 

60061 

60121 



ATATGCAGTG 
AGCCCGGTGC 
GAGCGTTGGA 
CTATGGACTT 
TGTTTTCTCA 
ACTATAGGGC 
TTCTTGATAT 
TGAGTATCGC 
GGGCCCTGGC 
TAGCGCCAAC 
CCTACTTTTC 
TAACCGCGGT 
ATTCAATCTT 
TACCCGTTCG 
AAAGCGTTGC 
TGTGCCAGCA 
CCATCTGTTC 
CCATTTTTAC 
TATTCATCAA 
GTTGCCGTAG 
ACAGCAAAAG 
CCACCCGCCT 
AGCCGGGGTT 
GTGGCTGGAC 
TGGGGCGCTG 
GCTGCTGCCA 
TGTTACGCCT 
GCCGTTACCG 
TGTTGTGTCC 
GTTTGAAAAG 
GGTTATTTGA 
GCAATAAAAT 
ACGAAAGGAT 
CAACAGCTTA 
GTTTAACTCG 
CGCGGAATCA 
GATAAATGAT 
TGTTACGCTC 
TATGTCCAGT 
AAACGTTGCC 
TCCTCGAACA 
TTCCAACATC 
TTCAGGCATT 
CATCGGTTCC 
CAGACAGCCT 
GAGAGTGATT 
AGAGAATGGA 
CCACGGGTGT 
TGAGCATCTC 
CTTTAGGGAT 
GCAACGTATT 
TCTGAAGCAA 
CAAGGTTAGA 
TATTTGAAAC 
TGTTTTTGTC 
TGCCGCTCAC 
GGTCGGTTTT 



GGAACTCCGT 
GGTTAACGTA 
TCACTCGTGT 
TAGTTTGGAT 
CTGTTTAGTT 
ATCGAGTGCC 
CTTTAATTGC 
CACCGCAACA 
CGAATTTTTT 
GTTAACGTAC 
CGCGTTTCCA 
CCAAGAACTC 
AGGGGATCAG 
TTGGCCGCTT 
CCGCATAGTC 
CCTAAAATGC 
CAGCGACCTT 
GTGGGCCGCG 
GGCTCCCCAG 
GAAACAATGT 
GTCGTGACCC 
GACTGGCTCA 
TTGTATGCCG 
TGTCTAGTAG 
GCGGGTATCT 
CGCGCGTGGG 
CCACAGTTTG 
GTTGACGTTG 
AACGGGGAGG 
GTACTATGTT 
AAATATTTTA 
ATGTTAAAAG 
TCCAAATAAA 
TGGAGTTTGA 
TGGTATGCCG 
TTACACGTAT 
AGGGGGGGCT 
GCGCTCGCCG 
GCGTCGATCA 
GACTGAAGCC 
CACAGCTTAG 
AGTCGCGGAA 
ACCGCCTCAG 
AGGCTGGCTA 
GCTGTCCACA 
TATGAGGTCC 
AACCGTGTCC 
GAGGTTGTAC 
CATAAACGGA 
GTGACATTCG 
CTGAGTGAGT 
CTGCGTTTCA 
AATTGTATCG 
CTGTTTCCGG 
CACAATGTTT 
TGTTTTGTTC 
TTGAGACGAT 



ACTCGTTTAC 
AGGGTCAGCG 
TCGGTGCGCG 
TCTGTTGTGT 
TCCAACATTG 
ACGCAATGCC 
GAATCTCGTG 
TTTCACGTTC 
AGGGCGCTGC 
GTGTGTCCGG 
CACCTCCCGC 
CCAAAAGCAC 
CTTCTCATAC 
TGGGTGGAGG 
GACCCACATT 
CACCGTGCGT 
CGCCTCTTCA 
GCGAGCGGGT 
CTCAGCGCAG 
TACGAAGGTT 
GTCTTGAGGC 
AGTGTGAGGT 
GACTCGCAAG 
AAGGCGCGAC 
TACCCACCCT 
TAAAATCGGC 
TCGTGACAAA 
ATTTTATGTA 
GTACGGAGAG 
TATTTTAAAA 
ATACCGATGC 
TCTCCGCGTG 
TGGGTTTCAG 
ACCGCACTAG 
CCAAGGTCGA 
ACCTCTCGTG 
TATCGTAGTT 
GCTTAAACAG 
TGTCCTCGTG 
TGGTGCGCAC 
GCGTATTGGA 
AAACTGGATA 
ATCGGTGCTG 
ATTTGTTCGC 
TACAGCTGGT 
ATGTGAGTCA 
TGCGAGTAGG 
GTTTGAAACA 
GGGACGTATT 
GATAGGTTAA 
ATATTAATAT 
ATGCATTGAA 
AACTGCTCGT 
AGACCCTGGG 
TGAGAGACGT 
TCCGCGAGCT 
TGCTCTAAGC 



GGAATCCCAT 
AGCCGAATAT 
TGACTGGAGC 
CCCCAACCCG 
AAGTTTACCT 
GAAAAAGGGC 
AAATACAAGT 
TCTGGGTGGA 
ACAATAAGTT 
GGGCCACTCC 
TCGTGTTTGG 
AAATTGCGGT 
CTGGCATTTC 
ATGTTAACTT 
CTATAGTAAT 
TTGTTAAAAA 
TCCAAGAGGA 
ATACCTGGGC 
CTGTAAGTGG 
AAAAATAACA 
GCATTTGGGA 
CTGCTCCGCG 
AGACCCCACC 
GTTGTTGCTT 
TTTTGCCAAC 
GCCCATATGC 
GCGTGGACCA 
CTACCTACAG 
TCACGCGGAC 
CAGAGTCTGT 
GTTATGAAGG 
GCCCTCTTCT 
TAAGCCTTTC 
GGGCGGAACG 
CGCTATCACT 
TAGGTACCTG 
TTCCAGCATA 
CTGCACGCGG 
TTCGACCACG 
GTGCACGCCT 
AGCCATGCAA 
ATTATAGGGC 
CTCCCCCATT 
AATATTTGTG 
CTGCAATATC 
GTTTGGCGAT 
CGATGCTCGT 
TTTCCCCTTC 
GTGAAAAAAA 
TGGTTGGGTC 
CGTGTCTGGG 
TCTTCTTTAC 
TGATCTGGTT 
TCGTACTTTC 
CTGTGACGAA 
TTATCAGCAA 
GAGCCAATAT 



CATTCAATTG 
GGTAATAGTG 
CTATCTTCAT 
CGAATTTTGG 

taaaacaacg 
gaaagatggc 
tgccgggcag 
cgaggcgtgt 
gttcggcgac 
ggagggaacc 
aagaccgcga 
acactggccc 
acct;u:^aaag 
gagtctctgc 
cataaaattt 
tgagttagaa 
atacaaccgg 
ggccattgat 

TTTCTGCCCA 
GTTCATTTCC 
CTTCCCGTAC 
TCCGTGTTTT 
AGGGAAGCAA 
AACAACTCAG 
AGGCGGTGTG 
CCTCCCCTAC 
ATCTGCTGGT 
GAGGCACTAT 
AATATACGAC 
CTGGGGATGC 
TGTCTGGTAA 
TCTCCAACGT 
ATGTCGTCTG 
CGCAAAATGC 
TCGATCCCAT 
GCCCGGTCAA 
GAAAGAATGC 
TCAGAAAAAA 
CACGCAAAGA 
AAATCAGGAA 
ATGCTATTTT 
CCTCTGGACG 
AGACGCTTCA 
GCCTATGTCA 
GTGTAAACTT 
TATGTACGTA 
TCGCTGCCCC 
CCACAGGCTA 
GCTGTTTACT 
GCGCAGACCG 
CCTGCCTCCC 
GTAGTCGTCT 
CGTCAGGCAT 
CGTACCGGTC 
ATCCTCAACA 
GTTTAGGAGC 
GTCTTTATAC 



TTAAGTCGCC 

GGCTGCGAAC 

TGCGATACCA 

TTCTCAGAGA 

GGCGGGTTAT 

GCATTGGGTA 

AAGTACACCT 

ATGTGGAACG 

CGGGAAGGCG 

CCCTTCCCCC 

AGGCTCGACG 

CCGTTTTVAAG 

CCAGGTACCG 

GAGACGACAG 

TCATCACTGT 

TACATAGCAA 

TTACTTGCCA 

AAAACAACAG 

TCACTAAATA 

TTTCACAGGA 

AGGATU^CTTC 

TAAAAATACC 

TU^CGGGACTC 

TGTTACCGAT 

TTAATTTTTG 

CGATTGACTG 

ACAAGGAATG 

GTGTTTTTAG 

AATTAGAGAA 

TGTGTAGGCT 

AAATTGTGCC 

TCTCTGTTAG 

CGTTTTCGTC 

CTTGGGGTGG 

ACAATACAAA 

GCGCGGACAG 

AGCAGAGTTC 

CGTGCGGATA 

CAAACCCAGT 

CGTTTAGTCG 

TTTTATGTAC 

AGCATTCTGT 

GTATTTGGTC 

TTTATATACA 

AAGGAAATGA 

AACGGGCCTA 

TGATTGTCGA 

GTTAGCTCTT 

ACGGAGCTGC 

GATAAAGCCT 

TCTGGTAAAC 

CGTTCTTTCT 

TTGAACACGG 

CCAAAGAGCT 

ACGTCTGTTA 

TCCCTCTTGG 

ATGTCGTTGC 
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60181 ACGTGCTACC GACCAGCGCT TTTAGAGGCG CGGTATTTAG TAACTGACAT AACTTTGCGT 
60241 GCTCCGCGGT CCTGTGACAT GTCATAATCG CGGTAAATAA CCGGTACAGA GGGCTATCAA 
6 0301 AGGCGACGCG TCCATTGTAA ATTATAGGCC GTGAAACAAC TTCAACTGTT ATATCTTTTT 
6 0361 GTCTGTAGGT GCCGAGATAA GCAGCCTTAT CTCCGTTAAA CTCCACAGAC ACCTCGGCGT 
60421 AGTCTGGAAT ATAAATAGAC GTCACATCGG TCATTATCGA GGTAATCTCC TGTGCCAGTT 
60481 TGTCGACAAT CCGCGATATC GTAACGTCTC CGGAAGCGTA CGTTTTCCTC ATCTGTTGAC 
6 0541 AGTGTGCTTG TATCCTGGCG ATCGTGTCGC CGTCGGGGGT GCGCCTGACA ATAGGCACCA 
6 0601 AcbsAAGGGC GACCACCCAG TCGACGTATC GTTCGTACGA TAAAACAGTC CTGGTGTGTA 
60661 GTACCTGATT AATAGTGTTC AGTAACAGTT GATCGACTGT TAGACGAACG GTACGTGCCC 
60721 ACGATTCAAA CGTAGACCGT TTGAATTCCG GCGTCAGGTG CTCGACACCG CGCCAACGGG 
60781 TCTTCAGGGT GGTCTCGAAG TTGGTCCAGT CCTTTAAGAC GTTGTCGTAT GTGCAGCTCC 
60841 CCAATGCCGT CTTGTACAGG TGTATGAACA GCTGTCTTCC AAAGATCCCT GGATTTCGAA 
60 901 GACTGGAGTG TAGCGTTTGA CCGCGAACGT ACGCGTACTT GCCCTGTAGG ATCTCGAAAA 
60961 GCGAGATGGA GAGTTCCGTC GGATGGACCA AACACGTCGA ACCAAATCCG GGGTTCATCT 
61021 TGAACATGAC GTCCGATGCC AAAGTCAGGG GTGTCGTGGA TCACGTCAGT CGCCTGTCAA 
61081 ATATAACTAC CAGCCCACCG GAAATGGGTT GGTACGACCT GGCCTTCGAT CCGGCTGAAG 
61141 ACTCCGGGCC GTTCTTGCCG TTTACCGTTT ATCTAATTAC GGGAACTGCT GGTGCTGGGA 
612 01 AAAGTACCAG CATATCGGCC CTGTACCAAA ATTTAAACTG CCTGATCACG GGCGCGACCA 

612 61 CCATAGCCGC ACAGAACCTA TCGCGTCGCC TAAAGACGTT CTGTCCCACG ATCTTCAGCG 
61321 CTTTTGGCTT TAAGAGCCGA CACATCAATA TAGCCGTCAG AAAAGCTCAT CAGACCGGAG 

613 81 CCGTATCCAT AGAGCAAATT CAGCAACAGG AGCTATCGAA GTATTGGCCG GTTATAGTGG 

614 41 ACATTATGAA AGAGGTTATG GCGAAAAAAC CCAATGGCAT GTACGGGACT ATATCCAACG 
61501 CGAATTTTGA AACCCTCTCG AGAATGACCG GACCGTGTTT ATGGACTTCC AATATTATTG 
61561 TAATCGACGA GGCCGGAACC CTGTCCTCTT ACATACTCAC CACCGTCGTG TTCTTTTACT 
61621 GGTTCCTAAA CAGCTGGCTA AATACCCCTC TTTACCGCCA GGGGGCGGTT CCGTGCATAG 
61681 TATGCGTCGG TTCGCCAACG CAGACAAACG CGTTCCAGTC AACTTACAAC CACGGGACGC 
61741 AAAAGACGGA GATATCGTCG TGCGAGAACA TCCTAACATT CATGATAGGG AAGAAGGTCG 
618 01 TATCTGAGTA CGTACACTTG GAGAGAAACT GGGCGCTGTT TATAAACAAC AAGCGCTGCA 
61861 CCGATCTGCA GTTCGGACAC CTTCTAAAAA TTTTAGAGTA TAATCTTCCC ATTCCTGACG 
61921 AAGTCATGAG TTACGTAGAC AGATTTGTCG TTCCTAAAAG TAAGATTATG GATCCTTTAG 
61981 AATACATTGG CTGGACCCGA CTCTTTTTGT CACATAGCGA GGTAAAGGCG TATCTAACAA 
62041 ACCTACACAC ATGTCTAACG CTAGGGGGCG ATACCAGGGA CACGAAGCTC TTTACCTGTC 
62101 CCGTGGTGTG CGAGGTGTTT GTGAAGCCGT TTGAGGAATA CAAGCGGGCC GTCAACCTCA 
62161 CCAACCTCAC CGTGACCGAG TGGGTGACAA AAAACCTCTT TAAGTTAAGT AATTATTCGC 
62221 AGTTTGTGGA CCAGGACATG TCCATAGTCG CCACGGAATC CACTGAACGT TCGACCCAGG 
622 81 TTACCTTTAT CACCAAGTTT GTTAAAT^CA GCCACGTGTC TTTAAATGGA AAAACAAAAA 
62341 AATGCATATG TGGGTTTCAG GGTACGTACT TTGAGTTCAA AAGAATCCTA GACAGCGAAC 
624 01 TCTTCGTGGA GACGCATTCG CAAGACCGTC CGGAATACGT ATACGGTTTC TTAAACACGC 
624 61 TACTGTACAA CGCCATGTAC TCGTTTCACG CGTACGGCGT GACTAGGTCA CACGAGAAAT 
62521 ATCTGCAAGA CCTAAAATTT GCACCCCTCC CGGCCGCTCT GGCAACCGGG CGTGTAGACC 
62 5 81 TTCAAACGGT TCGTGAAGAG TTAAACCTGG AAGACGACAT CTTCTACCAC GTGTGCAGTC 
62641 CCCCGCCACC CGCGGGTATC ACCTCCCTCC AGGTTTTGGT CGACACGTAC TGCGCCCTAA 
627 01 AGGACGTGTT CGCCTCCAGA ATAAAGGTGG CGTGTCGCTG GTTTGGCGGG GAGTTTGAGA 
627 61 AGGAAACGTT TTCCGCGTTT ACGGTTAACA TGGTCGTAAG GGACGGAGTT GACTTTGTCT 
62 821 CCCCTTCAGA ACGTCTCAAC GGGCTGTTGG CGTTTGCATC GACCGTTGAA TCGTATAAAA 
62 881 TTAAGGGGTA CACGTTTTTA CCGGTAGCGT TCGGTCGCTG TCAGGGTTTG CCACTCAGTG 
62 941 ATGACCTCAG GAAGAAGATG CCCTCCCTGG TCGTGCAGGA CTCTAGCGGT TTTATCGCGT 
63001 GCCTAGAGAA TAACATAACC AAATTGACCG AAACCATGGA GGACGGGAGC GTTTTCCAAG 
63061 TGTGCTGTGC GGGGG ACTAT GGGGTCAGCT CAAATTTAGC CATGACCATC GTAAAGGCAC 
63121 AGGGAATGTC GTTGGAGCGA GTAGCCGTAG TATTTGGGTC CCACAAGAAC GTCCAGACAA 
63181 GCCACGTGTA TGTAGCAATA TCAAGGGCTG TTAACTCAAA TTATTTGGTC ATGGACAGCA 
63241 ACCCCCTTAA AACCCTCCTC AGAGAACCAG TCGATAACAC CTCCGCCAAG CATATAGTCC 
63301 GCGCCCTCCA CAACCCAAAC ACAACCCTCA TCTACTAAAT AAATAAAAAC AAGCAAATGG 
63361 AAAACATTGT GTTTTTATTC AGTCCAACCA CGGCCACGGA TAGTTGTCAT TTCCACACAC 
63421 CGGGGGGGTG GCCGCCATAG TTTGACGACC AGAAACGCCC GGGCCCTCTG GGTTAGAAGT 
634 81 AGCCCCGCCC CCCGTGGTCG AGGTCATACG TCTCTTAGGG CCCTGGGTTG TACCAACATA 
63541 AATCACGTCG TCATCACCGT TGCAGTCCCT GCGTGACGCG CGCCTCTTGC AGGCGGCACG 
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63601 AACACCGCCG CTTGTAACGT GATCTGCATC 
63661 AATCACATCA GTGGCGCCCG TGCCCTCATC 
63721 GCCTCGGTCG TCCACTTCGT CGTCATCTGA 
637 81 TCCAGCAACA ACAGGTTCTT CTTCATCGGA 
63 841 TATATCACCT GACCAGGCGG GCGAGGCCCG 
63 901 TTCATCAGTG TCCTCGTAGT CATCTATAAA 

63 961 CTGGGGTGCG CGGTGTTGGC CAGTAACCAC 

64 021 CAGTGGCGCG GGCGCGCTCG GTGGTGTGGT 
64 081 CCGGCCGCTC TGGTTAAAGG ACTCACCGTC 
64141 GGAGACCGGC GGTTCGTCTT CTTCATCCAC 
64 2 01 GTCGGTTTCG TCTGGCGTTT TTGGAGTAGT 
64 2 61 AGGTGCAGTC ATGTCAATCA CTGGGGGGCC 
64 321 ACGGGACCCG GCGAACAAAA AACGTCTTGT 
643 81 CAGCATCCTG GGAGTGGCCG GCGGGGTCCT 
64441 CGAAACGTTC ACCTGTAAAA CGCACTGCCT 
64 501 GCACCCCTCC GATTCTGAAC TAGATAGTCA 
64 561 AACTGCGGCC AGGGAGAACC GGTTGCCGCG 
64 621 GCCTTGAGGA CGAGATGTCT CTGTCCGTTT 
64 6 81 CCCCACAGCA TAAAAACGCA CCTCTGAAGC 
64 741 GTGAACCAAT CCCAGCCAAG GTTGGCGTGT 
64 8 01 ATGGTGTTTA AAAGCAGAAC TCCCCGTTTT 
64 861 GCGGTGAACC CAGGTACAGT ATTGGCTATC 
64 921 ATACTGTAGT CCGGAGCAAC GCTAAAGGCT 

64 981 TCCTGGCCAA GGATCACGAC TTTAATATCC 

65 041 ATTCTATCTA TCGGTGGATA AATCACGGTT 
65101 AGAGCCGCGA GCTTCTGTTT TAAAAATGGA 
65161 CTAAGTAGCA ACGTTTGTGT GGAGGGTTCT 
65221 ACTATTGTTT TAAGCCAACC CTCCATAAAT 
652 81 GTGAAACTTT TGTACTTCTG TCTGAGAAAT 
65341 CGCTTTCACT ACCGTTTATG TGAAAGTCTT 
654 01 ATTGCATATA ATTCT^CAAT TGAAGCTCTT 

654 61 AAATCCTTAT AAAAAACGCC CTAAGGTGAA 
65521 GATCAGCTTT ATATCGCAAT GTTGCTGTAC 

655 81 AATGAAAAGA AGCAATCTCA AACGGCTCCG 
65 641 TACAACATGG ATTTACTACA AGCCCGGTAC 
657 01 TAAATAAACT TAAGGTATAC ATACTTCTCA 
657 61 GTACTTATAT AATATGTATA GAATTATGCC 
65 821 ATAATGCAAA CATCAAATAA TATAACACAA 
65881 TTAACCCGAA AATACAGAAA TGACTACACA 

65 941 ATATGCAACA ATATTTCAGG GTAACACCAT 

66 0 01 AGCTAAGCTC CTCAAAGACA ATTTCCTCTT 
66061 TGTTAATGTC GGAAATATAT GAGTCCAGGA 
66121 GGGCTTGTAA AATATCATCC AGTGAATCAG 
66181 TTAGCCCATC CAAATGGTGA GTGGGTGACA 
66241 GCTCCACACA AACGGAAGGA TGCATTTGAT 
66301 TAACATCAGG AACCGGACCC GTCAATGACG 
663 61 CACTGGTTGA ATTGACGGTG AAAAAAAATG 
66421 ACCACACGAG GGCGGAATAA AACGGGTCCC 
66481 GGCTCATTGC CATTTCCAGA TTTGTTAAAT 
66541 ATAACCACGA CAAGTTCAAA CAGGAGGCAA 
666 01 TCACAAACCC GAACTCTCTA GGAACGCACA 
666 61 AAACCCTAGT TCTGAGCACC ATCTGGTTAT 
66721 ACTGCTGCTT GTGAGGATTC AGCTTAGGTG 
66781 GCCTGGCGGC GCTCGCGCAC CTTTGACAGC 
66841 GCTCTTTGCA GTCGCGCCAT ATCTCGGCCA 
66901 GCATATATAT CTGTTGGGTA ACAAACCTCA 
66961 TTTCTATCGT TTTCTCTGAA TCTATAGTTA 



GTCGTCGCTG TCTGAAGATG AACACAGGTC 
GGCGTCTGAT TCGTCCCATG TCTCAACAGA 
AATAATAATG GTCTCTTGTC CTGCAGTTGA 
AGACCATCCG CCCTCTGCCG GTGATCTAGA 
TGGCGAAAAG CCGTCAGCTT CTCCGCCACT 
AAAATCGTCT TCGCTGTCTG ATTCAGATGA 
CGCAACGGGC CGCGGACCAC CAAACGCGGT 
GGCACCTGTC GAAGTAATGA CGGTAGGTAG 
CCGTGCGTGC GGATCTATTC TAGGTGGGGA 
ATCACTGTAG TCGCTGTTCT CCGAATGAGA 
GGGAGGGGAA TCTTGGTCAA AAACATCCTC 
TCCGTACCCG GGCGGTACCG GCAACCCCGT 
TCTCTTTTTC CTAGGTGCCC CGGGAATCGG 
TGACGGGTCG GATAAGAACA TAGCCATGGC 
ATCCCCGATC AGTTAATATT CCAGTCAACG 
TTCGCAAGTT TAAAATGGTT GCAGCCTAGA 
TGGGCGGTAG CCAGGGGAGA CGGATGGCGC 
ATCAGCGTCG CCTTGTCTAT AGCCTTTCTT 
TTGGCAGAAA GGCAGCTTAT TATGTAGCTG 
GACCCCGCCT TCCCTCTTTC CACCGTCAGA 
GCCCAACAGT CTAAGCACCC GTGAGAAGGA 
TCTTTAAAAA TATTTTTGAG GCTTGGAGGT 
AGTCCGGTGG CTTGACCGCG ATGGTAAGGA 
TCCGGTTCGC AACAATACGA CCACCACATA 
ACGTTGCTCA TATCCATAAC GCGCTTCAGC 
GAAAGATTTA AAAATTCAAG CCATGAGTCG 
TCTAACACTT CTGGTGACAT TTTACTCCAA 
CCCCGCGTCT GGTAATTACT TTACGTATAC 
CCAACAGAGG TGGCCGCGTA TAAGCAGTTC 
CAAACTCGGC TAAAAACTCA TCAAGACTAT 
CTCTGGGGAC ACTGTAATTT AACTTGTACA 
ATCCATTTAC ACATATTTCT GTGGTAGAGC 
ACAGCCCTGG ACATAAAGCT AAATTTGTTA 
GAAATATAAA GTCACTAAGT GGTATTATTT 
AACATGTAAT TAAAAAAAAT CCACATGTAA 
TGTTTTATAC ATAATATGAA CGCTAATGCT 
AATACAGTCA TGGTAAGTAT ATTTTAAGGC 
AATGCACGCT TCCGGATATC CGCCCAGCTG 
AACACACCTG AAACCAATTT TATTCTCAAC 
GTTAATAAAA TACGCAGCAT GCACATTTTT 
CAATTGATGA CTCATCAACC TCGGTTTCCG 
TACCCTCGTC ATCGCCACAA ATTAATTCCA 
CTGCTAATGT CAAGGATGTT GTTTCTAATT 
CTTTGGGACA AGACCCCGTA AAATTGGCAC 
CGTTCACGAG GGGCAGCATA TTTATTTTTA 
CACATATGTC CATTAAGATT GGAGTCGTGA 
TATTCGTGTC ACATGCGTAA CTATTTTTTA 
CGGCGCTAAG GCCGTGAATG CGGCAGGCAC 
CTGTTCGAAC ATACGAGTCT AAAATCAAGT 
CGCGGCTGGT TTCCACCCGG ACATCCCCGC 
TCAAAGCGTT CAGACAAAAT TCCGAAACCG 
AAACTGCGGT TAAGAGACGA GCGGCCACAC 
GCCTGCAGGC TTGCTGACAG GCCCGTAGCT 
CGCACGCCAG TTCCAGAGCG AGGCAGAAGC 
TAGGAGATTC GCTCGCGTGA TCTTTTAGGT 
CTCCCTGTAA AAGGGGAATA AGGTCCGTTC 
CATCTTTAAG TATAACCAAA GACGCTAAAA 
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67021 ACCCAGGATG GTCGACACCT TTAAAAAATC GCAAAAGCGA CCGGATGGCT CTATCCGCGT 
67 081 TTCCATGTCG TTCAAAGGTA GTCACGATGG ATCTCCAGTT AGACTCTGTC TCCCTGTTAA 
67141 CACCCTTAAC TGGAATGGAA ACAGCCATGA CACCGTGAAC TTCCTGATGT CTCCTAAAAC 
67201 TAACCCCGAA ACAGAGCTAA ATACCAATGA CTGTCACCCC TACCCCAAGC CACGCCCCCG 
67261 TACTATTAGA CCAGGGTGAG TAACCACGCT ATCCTTTAAA AACCCATACG TGGAGTTTGT 
67321 AAGGTAAACA GCTCGTGTAT ATTTCAGACG CCTGTCATAA AATGGATACC GACGACAATC 
673 81 AGGTAATTAA ACTTTTTTTA TTCAAGATTC AGGAGGGCGT GTTTACAGTG GTGTAGGTGG 
67441 GAGCATATCT CGCCAATGGG AATGGCTGAT GAATCCACAC TTAGTGCTCG GATCAGGTGC 
67501 TCTGTGTACT TTACTAGTGC CGATGAACCT AAATTTGGCG TGTTTAGGTC GTACCAACCA 
67561 AAAAACTGCC TGGCCTCGCC TTTTAGAAGC TCTATGCAAG CTTTAATCGT GTCCAATAGC 
67621 TTGTCTTTAA AAATGCAACT CTGGTACATC TTTATGACAG TGGTCCAAAA AAAACAAAGA 
67681 TTTAAAAACA CGTTAAACTG CGTGTCCTGG TAGTCCTCGT ATATAACCTC CTCAACAAGA 
67741 AAAAATTTTT TAACCAAACT CGCCAGGTAC TGAAACGATG GTACGGACAG GTCGTGAAAA 
67801 GTGTCTATCA TCTCCTTCAC CTCCCCTTTG AAGGTTTTGG TTACACCGAC CATGTGAGAT 
67861 AGGCACCAGC TCAGGGGGGA GGTCGGATCG TGAGGGGGTA ACAATTCGTG GGCGTGGGGG 
67921 TACTCCAGTT TTAGGATCTC CGGCAGGACG CGTACGAGTT CTTCGTCAAA TCCTACGCAG 

67 981 CCCGCATCCA CCAGGGGCAG TACTGAGTTA ATAAGTCCCC GAATTCTATC GTTCGTAATT 

68 041 TTGTGGAGCT CCTTTAGACA GTAAACCATG TTGCTCCGAC ACTGCGGTAC GATGAACCGC 
68101 TCTCCTTGTC TGTGTTGCAT GAGCACCTGG AGTATCTTAG ACAGGTATAG GGCCCGTTCC 
68161 ACTCTAAGGG TTGCCGCTTG GTAGAGCGGA TTGCGGGCAT TGACGGTCAG TGTGTTCATT 
6 8221 TCACCGACTA TCATGCCGAG TGGCGGATAG TGGAACTTGT ACATATGATT TAAAAGATGG 
68281 TCTTTCCCGT AATGGCGACT CATGGTCGTG TTTTAACGTT CACAGGGTGT TAAAA7VACTC 
6 8341 AGAAGGTCCT GCGTGGACAA CTTTTGCGGA CTCTCCGAAC CGCTCAGGGA TAAGCTGTAC 
68401 CACGCGGTGC TTCTTATAAA CGAAGCATAC GTCAAAAAAC ATGACCCACG TGACCTAAAT 
68461 AAGCTCACAG AAGAAGTCTG TCGATGTATT GTAATGGAAT GTGCCTCCCT GGGTCCCATC 
68521 TCCGGGCTCA TCGCGGATCT GAATCTCTTT AACCTGTTCT GCCTCTACCG GGGTTCCCGG 
68581 GTTAAAACCC GCGGGGCCGC CACCTGTAAC GTCCCGTGCG CAGAATGCGC CCAGGGCATC 
68641 GTGAGAATTC TCACCGAGAG GGCCCTTTGT TGTACCGAAA AGATGTTCAT AGCGTCTGCC 
68701 TGCAGCGGGG TCGTTATACC GCCCCAGTTG GCCAGAGTTC TGCACGACGT CTACGCGGAA 
68761 ATGAAGGCCA AGTGTTTGGG GGCGTGGCGT CGTCTCATAT GTTGCAGGCG GCCTATCATG 
68821 xGCTATTGCCG ACTCGGTCCT CGTCACTTAT AATACCCTGG ATGCCGAGGG AAAACTGGAA 
68881 CTTAGACTCA AAGCACTGTG CAAACTTGTT TTTCAACCCA TCTTTCTTCA AAGAATCTTA 
68941 GCCCCTATGC AGCTACTGGC CAACGGGAAG ATGGTTCCTG ACAATTATTT TACCATCACC 
69001 GGTACGGCCG AGAAGAGGCG CCCTGTCGTG ACTGGAAGTA CTAGCGGGAT GACGTGTCCG 
69061 GGAAGCAGCC TTGTCCCCGA TTCCTTAATC CTGCCAGTAT GCGAACCGGG GCTTCTCCCG 
69121 GCACCCCTGG TTGACCTCAG TAATGTCTTA GAAAATCCAG AAAtCATCCT CAGCGCCCCA 
69181 CCCCTGAGTC AATTTGTCAT CACAAACACG CACCCCAGTC TGCCTCAGTC AGTCAGCATT 
69241 ATTACGCCAA CCCAGGGCGT TGTTCCCGGC CAATGTTTTA TGGACACGTG GAAAGCGGTG 
69301 TCACAGAGCA TTCACCACCA GGCACAGACG CCTATTTTGG CCGCCGCACT AACCGGTTCG 
69361 ACATCTGCGG CCCCTGGCCC GCATATCGCA TGTTCCCCAG TTGCCGGCAC GTCTCGGCAG 
69421 GTGGAAGGGT CCGCGGGCGT CGATTGCGGG AAACCAGCAT GCGTTCCGCA GCCCGCGTTA 
69481 CCGCCCAATG TCCCCGCCAA GAGGATGGAA ACGGTAGCAC AGTTGGGAAA CGCTCCCGTA 
69541 AAAAACGTCC ACATCGGAGG CCGCGTATAC GCTCCACTGG TTAATATACC AATAATAGAC 
69601 TTAACGTCCC CGTCAGGGTC CGGCCAGAGT CCGGCCGATA TCGCCAACAC TCCAGAGTCC 
69661 CGCATGGCGG CCGGCTCTCC GCCCTTCGCC GAAACCGCCG CAACGGTCCC CGCTAAGAGA 
69721 AAGCAGCCAC GCGAGGACGT GGCAGACAAA AGACTGAAGG GCGACGTTCG GGGCGCCGCA 
69781 ACAGTAAACC ACCCTTTCCC GGGACCGTCC GGGATGCGCG TTCGCGAGCA GGGCTTATTC 
69841 GATTTAATCG AAAGCTCCAC GGATGTAACC GCGAACGCAT CTGGACCCT^ AAACGACGAC 
69901 GACATGCTAG CGGCTATCCT ACAGGACCTG TATGGACTAC AGTCCCCCCC GGCCATCGAT 
69961 TCCCCCTCCA GCAACTCGGA CAATGAGGAG ATATTTCCAG AGGTTAGTCC GCCATCTAGC 
70021 GGCCACGGAT CGCCTTGAAA GATGACCCCA AGACGCAAGA TCACACCACC GAGACAGTCC 
70081 CACGCAACCC GGCGGATGTC TAACCGTAAA AATATCACCG GTTGGAAAGT CCCTTGGTGC 
70141 GACAAAAATG AGCTTCAAAC TTACCCATTT TCTGTGCTAT TTCGGGACAC TAAAAAATTG 
70201 TCGCCAGTGG CGACAATTTT TAAGTCTGTG TAAATGTCAC TCTTTCTCAA GTTTCTGTCC 
70261 TAGAATTTTC ACATCCAGTT TCATATCATA TAAAGACACC TGTTTTGTTT TCGTGAATCT 
70321 GGCGTGTTTT TATCTTGAGA GGCCCCCTAA AAACATGCCT CGTGTGAAAA CACAACCCAA 
70381 GAGACCCCAA GTGCTTGAAT TTATGCCATT AGATCTCCAC GGTGGAACAC ACACGGAGAT 



21 



u a ^ .X nj^ o o m u s u E u ,JL 



wo 00/28040 PCT/US99/26260 



70441 GGATTCTCAA AACCTGTGTC CTGACGGCCA GGATCTGCTC GGGTCTTATA TCTATACGGA 
70501 GAATAACGGC CCGTTTTCCC AAATAATGCA CAATGGACAG AGCAATACCG GGACAGGTGA 
7 0561 AAGCTTCGGC AGCTACGCTG CCGGCGACGG TTTTCTGGGC GGTTCTGTGT CAGGGATGTA 
7 0621 TGGAAACAAC ACCGGAGAGG GCGCGTGTTC TAAAAGACCG TCCGCGTGCC GTAAACGCTC 
7 0681 GGCTGCACTA ATTCACGCGG CGTCCGAGGC GTCTGTGGCC GAGCAAGGCA CCTCACAGGG 
7 0741 GGCACATGCC GTATCTGACC GGATAGGCAG AGACGGTGGC GCTGACAATA GACTACTCAA 
70 801 GGTGAGTGCG CGGCTGTCGG ACAAAACAAA GAGCGCCCTT CGCAGCCATC CTTGCTTGCG 
70 861 TTGCTATTCT TTGATGTTTA ACACGTAATG TCTGTTTATA TTAACCCTGC CATAGCCCGA 
7 0921 ACCGAACACT GTGCGATGCG ATTATAAAAC GCAACACTCG GACACGTCAG ATGGACGCGC 
70981 AAAACGCGCG CGGCAGTGCA AGACCTCAGG AAAAAATAAA GCCCAACAGC TGCAACAGGT 
71041 ACAATAAGCA AATTTAGTAA TTTGTATTTT TTAAGTTTCC GAGGTTTTCC CGCTGGCCTG 
71101 CGGCACCGTG TCCGTGCTTT AGTTGGAATA CCTCTCCTAA CAAGCACGTA TTCCAACAGG 
71161 AGCTGGAAAG TAAAACAACG CT^CTGACCG TCCTCACCGC TCAACACGAA GCCCTTCAAA 
71221 AGCACGCATC ATTTTTACAG AAGATGATGA TATTGATGTG CAAAAACGGG AACAGTAAAA 
712 81 AAAGCACGTC ATGATTTAAA GCCACCTCGG GTTACTAGTT TAAAACTGCC TAATTGATTC 
71341 TATTCACCCT TCAATAAAAT AAAATTTTAA TACGTTTGCG GTTTGTCCAT TTCCTGTCTT 

714 01 AAAACTATTA AACAATTCAT TTGGGAATAA GGATTTATGG GATTAAGGGA TTAAATTTTT 
71461 TTGGATCATG GGATTTGGGA ACATACGTCT GGGATGGAGG TTATGCTTCA TGGTCTGGGT 
71521 GGCGTGGATT GCACGGGGAC GGTCGGTGTG CCCAACCTGG CACCTGACAG ATGGGAAATA 

715 81 CGAGGCGGTA TACAGGCACT ACCTCGAAGA GTGCCGCAAA CATGAAGGCT CGGGGAGCCT 
71641 GGACGGTTCC GGACAGACAA AGGGGTCTGG AACCAAAGCA ACCACCGAAG CTAATATATC 

717 01 GATAAGACCT AACGTTGTCA CATCAGGTCA AAATAAAGAG CCGCCTGGGA CAGCACCGAG 
71761 GGCCGAATCA TCACACGACC TGCCACGCAT CAAGCAGGTT AACGCTCTCC GATTATCAAC 
71821 CCCGGAATTG GCGCAACCAC TCCCGGTAGT AAAATCGACT CCGCGCGAGT CACAGTCAGG 

718 81 TGGGACACCC TGGAACGCGC GCCCCCACGC GTTCATTATG CACACAAACG ACATGCTCAA 
71941 CCCATCTGTG GTCCTGTCTT TCAGAGCCAT CCGTGCGCGG TCCACACGCG ATACCGAGCA 
72001 GTCCGTTCGC GATCGGAACA CGGTCACGAC CAGCTATCGT ACCCCTGGCC GCCCTTCCCT 
72061 CTTTCAAGCC AGACCCTCGT CTCACGGTGC GCGTCTACCC CCTTCGCCCC GAACGATGGC 
72121 AAGATACGCC GAGTCGCGAA CAATATGCGA CCAAAATTGA CCGCAAAGAA TACCACGCCG 
72181 TCCGTTCGAC GCAGTGCCCG TATGGACGCT GCACGGGACC TCAAGCGTCC GTGTCAGACC 
72 241 CATTCCTCCC TTCGCCGACG CCGTGGCACC TGGTTCACCC CCTGTGGGCT ATGGCCGGAA 

723 01 TATTGGGCCT CACGATCGCC TTGTACACGA TATATCAAAT ATATGCACAT AGAAACTACA 
72 3 61 CGACGCTTCA CTGCGAATAA AGGTTTATTT ATTTTGCACA CTAGTCCGCG TCGTTATTTC 
72421 TGGTCCTGGA CTGGGCGCGC CTCCTCCTGA GCGCCCCGTC CGCGCTGGCA GCGATTCCGC 

724 81 CTCTGGTGGT CTCGTCCATT GACACGTCCA CGCGGATACT AATTCCGGCG AGGGCGTCTT 
72 541 CCATCTGCCC GCGCGTCACG GCCTTGGCCG TGGCCGCCCT CACCTTAGCT TCAATCTTAC 
72601 GGGTTGCCAC GTTACATAAG GCCGAAACGG CAGCTGCGAT TTTGGCCTCC TTTTCGGTGT 
72 6 61 TAGAGATGAC GGGGTCGGAG CCGGACCTGG TGGGATCCCC AGAATTTATC AGCTTTTTGA 
72 721 GCTGCTTGTT TTCTU^CGGAC AATTTTTGCA CCTGAGCGGC TT^CTCCTCC ATTGTCAGCT 
72 7 81 CTTTTTTGGG GGCGCGCGTC TTGGGACGCG TGGAAGACAT GGCAGCTACT ACGCATTACA 
72 841 GAGTACAGCG GGACGTTTAA ATACTTTCAG GCGGCCCCTG AGGTCACTCG TGCGCCACGG 
72 901 TTTCAACAAA TTTAAAAAAG CACATATATG TTAGGTACAG AAATGTTGCA ACCGCGACCA 

72 961 GAACAGAGTT GATCACCGCC CATATACTCG AAAAACTAGA CAGCACGGGC TGGTACGTGT 

73 021 CCGCCGAGCA ATCATAGTCA TAAAAAGAGG GCGTTACGGA GGGAGCCGCG GAGGGGTTTG 
73 081 TAGTCACAGG AGGCGTGCCC GTTCCTGTAG TCGAGGAGGT GGTGACGGTG GCGCACACCG 
7 3141 GCAGACACAG ATATAAGCAC GCCAACAGGG CTAACGCGAG CACAATTGAC CCTGTCATTT 

732 01 TCAAGCCGGC GTTCGGCAGG CAACCCTCCA CTTTGGCACG CGCGCTCTCC TTATATACTA 
73 2 61 CCAGTTAAAC ATTATGGCAG AAGTGACCGC CCATACGGTT CCGTACGCCT TTGACTCCTG 

733 21 TAAGTTTGAA ATAATACCAA AAAACAACTC ATCTCGTATA GCGTTACGGA ACAAATTTCC 
73 3 81 CGTCGTGGTC AAACCGGGAG AACCTCTAGT CGTGCCCCTG GGACTAAAAA TTATCCGCGC 
73441 GCCCCAGTGC GCATTCTTTC TCAGCGGAGC GCCGACGGAT GAGGTGTATT ACCACACCGG 
73 501 ACTTATAGAT CAAGGGTATC GTGGAGAAAT AAAACTCATC GTTCTCAATA AGACTAAACA 
735 61 GGTCGTCACA CTTTACCGGG GAGAAGTCAA CGTCTCACTC ATCGCGTTTA TGTACGCCTC 
73621 GCCCGGGCCC CTAAAATGCC CGATACTGAA CCTACCACAT TACTCTCTGG ACGCTGGATT 
73681 TGATGTTACG TCGCCACATG CCATGACCAT ACCCCCAACC GACAGAACCC CGTTTACGTT 
73 741 GAGTTTGTAT TATAAGAGTC CACAGCTCAG CACCCCCCAC GTCCCACTAA TCGTGGGGCG 
73 8 01 CTCTGGACTG GCGACAAAGG GGCTTACCGT CGACGCGACC AAATGGACCC AGTCGTTGGT 
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73 861 ACATCTGAGG TTTTATAACT TTACCAAAGA GCCAATAGAT ATACCAGCAA ACAGCCGCAT 

739" Stctcaggta gtgtttatcc acgaagacca cgtcccaagc gggtggaaca ttcttagatc 
7^98^ cJScgtacaa ctcggcagca ccctccagat atcgtgggcc aaaatcaggt ttaccgacgt 

74o!i GGCCACGCTC CCCAAAACCC ACCCGCTCAA CTCCCGCCAC ACTCAAAGCC AAACCGAACC 
7I1OI GGAGACCGCC CGCGGCGCAA AGGGGTTGGG GTCGTCAGGG TTATAATTTT AAAACATGGC 
7! 161 ATTATTTTAA AGTTGTTTAT TTTTAATAAA ACAATCCAAA TTTACGTTAA CCATCAGCTT 
74221 GCGCATCCTG TATTATTTTG TCTGGCACGA TATTATCCGC CACCGCCAAC TGGGGCGTCG 
itlel GTTGTCGAAT CACCGGATGG GAGGGGTTGG AGGACGCTTC GGTGTCTACC AGCACATTAT 
74341 TAATTTCCGC AGCCCACGTG ACAGAATCAT CGGACGCGAT CGTCGTAGCC AGTTTTTCCA 
74401 TTTCCAGGAG TGACACGGTC TCGTCGGCCC GTTTATTCAA AAGCGCGTGA AAGGCGCTCT 
74461 CGTCCACCGT AATATCCCCT ATAGCCAGGG ACAACATGGC GTTCCAGATA ACGTTCTGTA 
74521 TAGACTTTAG CTTAGATATT TGAATCCTAA CCGTCTCCTG GCACCCCGCG GGTACCCTAG 
74581 cJSgtcaaa ACGACGAAGA TACTCGGTTA TTTTATTGGA TTGAACGGCC AACAGATACG 
746!i TCTGCCGAAG GGCGGCGGTG CACATCTCTG CCTGCTTCAC ATCTCCCATA TCCACTCCCG 
7I7OI GGGGCAGACC CATGGCTATT TCGGTCCGCC AGTGAGTAGG ACACTCGTTA AGAGGATCTG 
7^76^ gSgggScg ATATTGACAT CCCACGCGTC CAAACGGACT AAACCCGCAA CACGTCCAAA 
7482! aSScSgg CGCGGCCATT GCCAAACAAA ATGGTAGATG aaattagggc aattttctct 
7418^ icTAGTGGAG ATATGGCCGA AGTAATTACG GATATACTGA CTGAAACGCA AGCAACGGCG 

illll ?c???Stct gcgtgctcca cgatcggggc gacgcgccta taaatactcc acatgccgta 
750ol ISSStS gcctgcccgc caagcgccca ggcggcgggc caaggtgttt accgttgatg 
7^06^ SSJtgSacc taccggcgtg gcaggttaat ctattcctaa caggtgacgc accattgacc 

Vslll ?SS?^CA TTAAAGACCG CATTGACCTG GCTCAGACCG AGGAAATACT CGAACCCATA 
7^18^ ??SS?S?AC TGGCATGCAA ACGGTCCGCG CAGCAGACCA AACATGACTC GTTTAAATCT 
islll SSStcgSt GGTTCAGGGG AAAGTTCGTC TCGGCCCTAA GAAAAGTGTA caaaatgacc 
• lllol CCATCCCCCT ATTGGATGAT AACGCTGCTG GGCTCGTTCG AGGCATCGTT TGTACTGGCC 
7^361 ggScg?ttt ATTTTTTCCA GTCTCATATA TGCACGGCGG AGACGCTGGT GCATCTAACG 
mil agStattca GCTCCAGTCA GGGCCAGAGC TTGGTCACCG taaacaccta tgacgaatta 
islel SggSJSat tcgggcggtc agatttcctt ggaattgttc caaatttttg ggcgtatcta 
7^5!^ SSScS^ tgcaacagga cgacgtggag tccagggcca tcgatcaaac aatcaactcc 
7^60^ 5?JcotSggg ggttgatgct gtcacctcag gacctcgtac acttcatcta cctgtcattt 
illll ?Icgagtgca tgaacgcaca gacgttcctg tcgtactctc gtaccacgtc cagtttaccg 
7512I Ic?c?CGCAA cggttaaccc tccgcagctg tgtcggcgct tagaagcgga ttttaaggag 

7578^ JJcGTGATGG CATATTACAA TAAAGCAAGC TACCTGAGTA CTTACATAAC CATTTTAACC 
lllA S?ACCCGCTC CTCTCCCGGA CGGATACGAA AACTXTCAGG AACTAGCATG TCAATACTGG 
7^9^^ ?SSga?S? CGAGAGACGT GGCGGAAATC ATGACTAGAA TTAATGACCA ctatccacag 
lllll CTGAATCTAA CAAAAGATTT ATCCGGTCTC CTTGACCTAG CGGCCCTAGA TCAGTATTCC 
7602^ SgSSJccJa AGGAAAACCT TTTTACGGTG GCTTCGAGAA TTCCCACATA CAGGTGCGAG 

7608^ ?????gSS aacagtattt tgttctaatg cacgcagact gtatagacgc gtactggaaa 
l All ?SS?J??A tcgtgcccga agacgcccaa ttgcaaggcc tgacggatca agatctcacc 
762oi 5?^Stat tttactgcga cctcggccta tctctaccta catttaaaca acaaattcta 

7626^ SS?ScG?J ACGAGTATTT CAACCCACGA CTCCCAGTTT ATAGATGGGT GTTGGATTTT 
76321 g™JSgg TAACCGAAGG TAGACGGACT CTAAACGATA TCTACAACAT ATGCGTAACG 

7638^ ™g5Sgg tcatattaga aacgttgcag ttaataggtc cactaaaacc gaaccacccc 
764!^ SgtStttt ttaagtcggc ctgtccagcc gtaacctggc cggatgatat ctcagacacc 
76501 gSSS^ actgtgacgc aaaaataggg atgagaattg taaccccgtt tccgagcgga 

7656^ ?AcSS?GG TGGGGTCCGC TCCACTCGTG TCCCTGACCG ACATTTTAAA CCGCGTGGTT 
76621 AAACTGGACA CGCGGCTAGC CTCAGAGTAT cctgggatct tagaagataa aggaccattt 
76681 SS?5Sggca tttacgccaa ggggcgatgc gtgcgcgtac ctcactgcta taaggttggc 
iBill ccggSggcg agctgtcacg gctcctaaaa attattatct gtcaccccga agagtcagat 
illol S^?SS?GT atttgaaaaa cgcgtttaag gtttctaatc tgctacatca cgcccctgga 

768°! S^C?G?CA CCAAAAACGG CCACCTGGTG TACGCGATCA CTGACGAAAA CGAGGGCTTT 

769" SSS^GCA AAACCAAGAA TAATCTTCCC AAAACAATCA CGGACCTGGC CGAAAAAATT 

76181 GAG?^CCA CAGAAAAACC ACTAATTGAT TGGGCAGCGA CCGCAGTGTG GCCAAAACTA 

7"!^ ScGaScCA TACAGCGrXT CTTTCCGGAT ^ACCGCATCG OCCAATTTGC JJJTGTGAGC 

77101 TTCATGCACT CCGGAGACAA CATTATACAA GTTAAACCAC AAAAAGGAAA CAACTTTTTC 

7716^ ?gSSS?c ATAAGCACCG AAACCACACT CAGACAGTCC GTGTATTTTT AACCCTACAT 

772" tcSSSIg AAAGCGAGGT CACGGTGACC TTTATGAGTC AGTGCTTTGC TGCAAAATGT 
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77 2 81 AATCATAATA GCCCAACTGC ACATTTTTCA TTTATGGTAC CCATAACCGG CACGTAAGAG 
7 7341 ACACGAATAA AATCATCCGT GATGGGCCAG CTCTCCGGCT ACTTTGTTAA TTTTAACACC 
7 7401 GCGTAACCCC AAAAACAACA TCTGGACGAG TTTTGTTCGC GAACGTGAAC ATGGCACAGG 
7 74 61 CAATGGTGAG CCTGGAATAC ATGAAGGATA TTTTGGACGG TAAGAAGACC feCTTGTGGCT 
77 521 CTTACGACTC CCATTTAAAA CCGCAACTTA TCAAACGGCT CTTTTTATAC GATTTACATG 
775 81 CGGTATGTGT TTCACGCACT AATTTGTTTT ATAGGGGGCA TATCATCGTC GGACTTCGAC 
77 641 GACTCGTCCT CGGACGAAAT GGACGACCTG TCCCCAACGC CGGAGCCAGA ACCGTCCACA 
7 7 701 ACGCCGAACA GCTTTCCGGA GGGGCCCAAA TCACAAGTGG TGGCGTTACC TAAAATCCGT 
7 7 761 AAAAGATCTC GATCTGAGAC GCCGGTAAAA ATTGAGCACA GATCTCCACT TAACCGCTCG 
77 821 CGATCTCGAT CCAGAACGCG GTCCGGGTCC GGTCAACGAT CAAAGCAGTC AGGCAGATAC 
77 881 GTCAAGAGAT. TCAAACCAAC GGTTGATGCA CCCCGTCATC GAGAACCGTG GCACAGGGGC 

77 941 GGTAAGGGAA AGGCCCCGTT TATCCGCAGA GACGCAATGG CTGGCCGCGG TCGACGCACA 
7 8 001 TACGGCCACG ACTATCGCGG AAAAGCCGCT TTAACGCGGA GCATTAAAGA GTCTATTAAA 
7 8 061 AAGATGCACC TTCCATCCAC CATGCTCTCT CGTGCGCACG ATAAAAAGGT ATTCGAGGGA 
7 8121 CTGTTGCCAC GACACCTGGG ACAGTGCTTT CAGGTGTGCC TGCCGGCGCC GCCACCGCTG 
7 8181 CAACCCGAGG TGTTCACGGA TCGACAGCTT ACCGCTATAG TCAAGTCTGG CGGGCGCAGA 
7 8241 GACGCGCTGG TGGCCAAAAA AGTTAGCCTG GCTAAACTAA CAAGCCTATA CAAACCCCTG 
7 83 01 CTCACGTTTG TGACGGGGAG AAACAACCAG GCCCACTGGT TGGCGACGCG CAAAAACACG 
7 8361 CTAGCGTCTG CGGGACTCGA GGCTCTCGCG GCCTTTATCG AGGAAGGTCT GGCGTGGGCC 
7 8421 CAGGTGTGCG^ TATCTCAAAA CAGATCGTTG T^CGACAGCA ACCTAGATAT CATTCTAGAC 
7 8481 AGTAGTCAGA GCGTGTGCAC CTGGTTCATC TCCAAAATCA GGCATTTGCA CATACAGTGC 
7 8 541 TTCTTGGAGA ATCAGGGTGA GGTTAGCCTG GTGAAGCAGC TAACTTACCT CGTGTGTATA 
7 86 01 AATAACCGCC TGGCGGAGGC GGCGAACCTG GCGGGAGAGG TGAAACTGAA TTTTAAACTC 
7 8661 GGAATGCTGA TTGGGTTCGC CCTAACCTTG CCGGCGCTCC TTGCAGAGCA TAAGTTGTCT 
78721 GGCGAGAGCC TGTACCTGTT TAGGTCTTTC CTGGAGAAAT ATAGACCGGG AGATGTCATG 
7 8781 GGACTCTTAA ATTCAATAGT GGTGGAGCAC TACACCAAAT GTCGTAGCGC GGAGTGTGTC 

78 841 ATTACAACCC ACGCCATGGT AGGATCTGGC GAAAACAACA AGGGACTTTT TTTCTTTCCA 
7 8 901 GTGTAATATA TAACCCATGT GTGTAAGTJ^G TTACGGTATA TTATTCACGG GCGTTTAAAT 
7 8 961 GCAATAACCC ACATAACAAA ATAAAAATGT GTAAAACCAA ACACGCGTCA AGCTTTTTCT 
7 9021 GCAAGGGTTC TGTGCTCGGA ACAAATAGGC AGAGTTCGCC ACCTAGCGGC ACGTTCTCTA 
79081 TAGCCTTGGT ATACAGCGGA TGTTATTTAC GGAAGTGCCA CATGAAACCG TGGCCGGGTG 
7 9141 CGTAACAGTA AACCGGGTCC AATCGCGCAG TTTACACAAC ACATAAAATA GGCGACCGTT 
7 9201 TATAGCGGTT TATTTGAACA TACAACACAC ATTTCAGGAG TTTCGGTCTT TATAAAAATC 
7 9261 TTCATTCATT CAAAGTGCCG ATATATTTCA CGCTGGTGGA CAACAATACA TATCGGCGCC 
7 9321 TGTGATTGAT CCTCCTCCTC GTATAGGGAC CAGGCAGACA CCCAAATCTG TGCAAACGGA 
79381 TCGGGTGGCC CGGGATCAGA GGGTCGTGGT GACTGGGCTA ACTTTAACAT GTATTCATCG 
7 9441 GTATCAAAAA CGCGATGTCC CACCCCAGAT GGCACGCGTC TTGGTTCAGA GGTGTTTGAG 
7 9501 GTATTGCCAC GGAACCATAA ATTAAGAAAT CCATAGCCAT TCATCATAAT CCCATACTGA 
7 9561 CAACTGCACA AAACAATGCC GTCCTCCAGG GCGTGTJ\ACG CGTTCACAAG TCTGCTCAAC 
79621 ATAAAATCGT CATCCATATA AGGCGCCGGG GGCAACCACA ACTGAAGCGG CCCGGGAGCC 
7 9681 ATGCAGACGT GGTGTTCATT TTTATCCGTT GGTCTCGTGG ATAAACGAAC GCCCATCCCA 
7 9741 GAATGGGTAA AAGTCTGTGC TTGAAGGACC CCATAATAAT AAACCTGAAT ATGCATAAAT 
7 9801 CCCCACGGGT CGTCAACAGA TGGCGCCAAA CACGCAGAAA GACCGCTCGA TGCAGATAGC 
7 98 61 GGCAGGGCGT GTTCCCCGGG CAACGCAACC GCGCCCCCTT GACTATTGTC CTGCCTACCG 
7 9921 TCTTCACCGG AGGCGGTTGC GGTTGCAGAC GCCTCGGTGG CCGGGGCGGT CATGGTGAAG 

7 9 981 GTACCGGCGG GACGGGTACG AAGTAGCCCG AGCAGCTGCT TTACCGTGTA TTCAGATACC 

8 0041 CTGCCCGCCT CGTCGGCGCC CTTGCGCTTA AACATCCGAA ACCTCAAGCC CCGCAGTTCT 
80101 GCCGCTACCC CAGAGGCCCG AGGGCACAAC CTGCATAACA TCTCTTCATC GGAGCGCAAC 
80161 CTGAAGACCA CGTTTGCAAA TGAAAAACTC CGTCTCAAAA CGTCCTTTTC TTCTATGAAA 
80221 TAGGCACTCT TTCTTATCGC ACCCAATAGG CGCGCCTTGG CCTGGCCCAG CCTAGACCTC 
80281 CCGGACGGAA GGGGGATGTT TCTCTGGTTG CAGTAGTCCT CGTACATCTT CCCATCCCGA 
80341 AGGTAATCAA ATCCCACAGA CCACGGGGTC GTAGTCGCCA GCACGACCCT GGTTTTTTCG 
80401 TCGTCTGCCC AAAACAGTCC CGAATACGTT GCAAGATTTG CGTGAGTAAC GATCCATTCT 
80461 CGAAGTCCCA TGGCGGACGG ACCACCGCTC TGAGGTGGGC GCCAGGTAGC CATGGCTGTA 
80521 TATGCCAGGT TGATACTCAC CGCTTTCAGT TCGTGTTAAA ATAACCAAAC TCCGCCCATG 
80581 CGGCTACCGC CGTGGGCGCA GATGTGTGCA GCCGTCACAA AAGAATCCCA CCTGCGGCAA 
8 0641 CTAATTCCCT CAGCGCCCAG TCAGCTGGCT ACCGGTTCAT TTCACTCACG AGCACGCCAC 
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80701 CCCATCATCT TCACCGCTCG TTCCCGCGTC CGGATCGTCC ATTGAATCTG GTCCGGAGTC 
8 0761 GCCCAGCTCT ACTGCCGGGT CATGCGATTG GCTCGTCCCC TCTCCTGGAT CTTCCATCTG 
8 0821 GAATGAACAG TCGGTGCGAC CCTCTGATGT TACAGTCGTC CCGGGGTAAA AACAACCCTC 
8 0881 TGTACCGCAT ACAGACGAGG TTTCGCACAC GTAACGCAGT CTCATGGCGA TGGGCGCATC 
8 0941 GACGGGATTC GGCGACTGGA CCGACAGACA CAGGGCGATC TGTGCATAAT CTCTAGGTGG 
81001 TTCGTCTCCA GGGTTCGGGG ACCTTGCAAG AGCTCTAAGA TAATCCACCA TATCAAAAAC 
81061 CTGAACGGGT TGCCTGGATG GCAGGTCCTT ATACTCCTCC TCATTGTTAG TAAGCACCTT 
81121 CAl^-SCrCCCGG TTCTCCAGGT TCTTCACGCA AATGCCAGTA TCCGTAAGGG TAATCAACAC 
81181 ACCCTTCTCG CACTTTTTGT GCATAACCCT TAGATCTTCT CTGGCAAACT GACAGTCAAC 
81241 GACCCCCGGT GACGGTAAAA ACAAAGTCCT CTCATCTTGG AAACAACACG CGTGGCCAGC 
813 01 GAGGACGGGA CATGGTCGAC TGCAGACCCG TATTCCAGCA TGAACACGTT CGAGGCCAAC 

813 61 GGTCTCTCCA AAATATAGAA ACGTTAATTG GATGGCCCCT GGCAGCCGAT GCTCGCGCAC 

814 21 ACGCTTGTTT TTGTCCATCC GACTCCGAGC CAATACCGTT GCGGTAAAAA CCCGTCTTCT 
814 81 TTCACGTCCA GACCCACCCC CCTGGTCGTA TTGGAACTCA CTGATTAAAC CJ^GGCACTG 
81541 AGTTTCGGTG GAAGACATCA GATTACACAT TGCACAACTT ACCACCGGTT CCGGCAACAG 
81601 GCGAAACAGT CTATATCGAG CACCCGGTCG CCCATAGGCC TTATTAATAA ATTTAAGCTC 
81661 TCGTATACTT TTGTGGTGTC TCATGTCGTA CAAGAATCTC ATTTTCCTGA ATCTGCCATA 
81721 GTGGGACGGT TTTTCGCCCT GACATATGCC TCGCATGTTG CAGTAGTCAT CAAAAAACTT 
817 81 CTCGCTATTC CATACGGACC GATCTGCAGT TACTTTGTTC CACGGGACCC GGACAACGGT 
81841 CTTGTCCTCA TTCTCCCAAA CCAATCCGCG GTACTCGCCC GATTCGACCG CGGCAACGAG 
81901 CCAGGCCCTG ATATCGACTC CACGGCCCGC CATTCCGCGT ATTGGGAGCT TGACGGATCA 
81961 GCTCTCAAGA CAAATGAAAC CCTTGCGTTC ACGGCACCAG TTCATAAATG TATCTAAATG 
82 021 ACGGAGTCAT GTGCCCTAAA CATATGGCTA AGACTACTCA AACTGTGCTT AGGGGAAGAG 
82 081 TGACAAACAC CTGGCACTCA ATAAAATTTT GAGGTCGGGC ATTGACCCGT GCCTTCCTCT 
82141 GGTTAGTATT AACCCCATCG CCACCATATT CAAGGAACAA AACACTCCAG ATACAAAACA 
822 01 CTTTATTACA GATAGAAGGC GCTCACCGGC CTCACAAAAC CGGTCGCGCG CACAGGTAAC 

822 61 ATTAAACATT AGCGGCGGAC AACGCCCGCC ACAGACACTC GTGCCACAGC TGAATAATTA 
82321 GCGGTACAGT GCTTTGCACT CCGCCCTCGG GTGTAGGTAC GCCCCCAAAA TATAGGTAAA 

823 81 CACAAGCTTG TGGAAGCGCC GCTCCGTCCG GAGATGTTCT GGCCATGGCT GTCATATAGT ' 
82441 CAACCAAGTT AAATGCACGA CACGCGTTGG CGCACGACAG GACCGCAACC GCCCCCGTAG 
82501 ATTCGACGTT CCCTTGAACG AACGCCAGGG AATGCCCCAA CCATCTAATG TAAATGCCGG 
825 61 AGGGGTCGGC CCAAAACGCC AACCCGCGCA AAAGATCTTT ATCCAGGAAG CGCAACGCCT 
82621 CGGCCTGAAA CTCTGCTAAG TGGCCTGGAA TCTCCGGAAC CAGTGCCTGC TCGGCGATCG 

82 6 81 GTGCGGCGCA TATGTGCCCT TGAGGGCGAC GTTCGGACAA TGGGAGCACA CGGATGCCGT 
82741 TGGACGATTC AGTCACCACT TCACCAACTC TGTCCCCATA ATAGTAAAAC ATTATACGTA 
82 8 01 GCATCCAACA CTGTTCACCC CCGGCTCCCA CCGCACCAGT GTACCGAACA CCGGCGCCAA 
82 861 CCGGACCAAA CTTATTCACG GCCTCCTCGC GCAACCCCCG TAACATAGCA GTAGTGCCGG 

82 921 AGGCCTGATC GCAACGAGCA CACGACCTCA CGGTAATTGG TAAAAGGCGA ATGATTCGGC 
82981 ATCTGTCCGG ATATGGAGTC GAAAGGTTGT CCTTAGTTGA CACGTCCGAC ACGGTTTGAC 

83 041 TATGCCGTAT GGCGCTGGAT AGCCAGTTTT TGCATTCCTT AGCGGTTAAT TCCCTGCCAG 
83101 CCGCGTGCAA GATGCCCCTG GATCGACAGT AGTCAATAAA AATGTTTTTT TCCGCCTCCT 
83161 CGACTCCTCT GCTCCCTCTG TCGTGATTCC ACGGTATGCG GATTAAAGTT CTTTCCTCGT 
83221 CAACCCACCG CATTCCGGGA TGACGACCTG TCTCGCAGCA TTCCACCAGC CATCCTCGAA 
832 81 GGCTGGATGG TTTGGTCACT CGGACCGGCC GCTCCATATC CTCGCCGCGT GCACGGCAAT 
83341 GCCAAACCTT GTCCAAGTTA CCTCCTGACA AGCACCTGCG ACCCTCACAG ACCTACGCGT 
83401 GGCAAACGGG AACTTGTGGT CTAGTCTGCT GGGACAGTAC CTTAGCGTTA TTTTTATCTG 
83461 TCGTGCACTC CTGCGGCGGA AGTTTGTGGT TTATGAGCAC CACCGGGAAG GATTCATGTG 
83521 TCCCTGCACC AGCTCGTCCT GACAGACCAC CACAGAAATA GGAACTTGTG GGCTCTCTTG 
83581 GGCACCAGGC TGCTGTGAGA- GATACAGTGT TGCCCCGTTG CACGGCGCAT GGCTCTCTGG 
83 641 TCCCGGAGAG CGGGCAAGCT TTCTCATGAA ACCCATGAAG TTAAAAATTT TCTGTTTAGA 
83701 GAGGAACATG ACCTCTCGTT GGATAATATC ACCATTACAG TCGTTACCCG CCGCGTACAG 
83761 CGTTTCCCTG TTCCTATTAT CGACGTACAC TCCGGTTTCG TTGCCGTGTA CGTAGATGCC 
83 821 CTCGTTATAC GCGACCAGCA TCGTTTTTAT TTGCGTTATC TGTTCTGCGG TCAGTTTTTC 
83 881 ACGATCTGGA GATGGAAACC AGACAGTGAA CCCGGTGCCA TAGAAACACA GATGACCGGG 

83 941 ATGGGGAACC GGGCGCGGAT GCAAACGAAC ACCTCTCCGA TCTTGGGACG TGTGATCCAG 

84 001 TGCCAGAACG CCAAAATAAA ACACCTGAAT TCTCAACCGC CAAAACGGCA CAATTTGATG 
84 061 AGGCAGCGGC GCTTCTGGGA (ZCOCGJkG^CQ ATGTGCTCGG GCGTACGCTG ATCTCGTCGG 
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84121 AGGGACCAAC GTGGGCAAGG GGGCGGGCAG 
84181 CACCTGCTCC AAATAATTTA TAATCCCTTC 
84 241 TCCAGAATCC ATCAAAGGTT TTACCCGGAA 
84301 GACCCCATGC TCCACTTCTT GAAATCCAGC 
84 3 61 CTTACACTCT AGCAGGTTTC CATCTAGCCC 
844 21 GACAACGGCT GGAGTTCCAG CTGCCCCCTT 
844 81 TCATCGCACC AAGTTAGGTC TGAATAGGTT 
84 541 CGTAGATGGC TGTAGTTAGA ATCTCGATTC 
84 601 CGCGATACGA GTGTACTTTT CCCCTGATGA 
84 661 GCACCTGGCT AAGCAAGCAG CTCTATTTCT 
84 721 TTCTGGTCTC ACGTTTTGCA GCTCACCTGC 
84 7 81 TCCTCCAGTG CCTTGACAAA CACTGGCCCA 
84 841 TTCGATTTCA GTCACCGGCA GTGAGTGGTG 
84 901 AGCTGTGCAC TCAAGCCCGA CCAATATGGC 

84 961 ACAGAAAAAA CGAAACCACA CGCGAGGTGA 

85 021 CGGCGAATCC ATCTAACAAA ACAGCAAATG 
8 5081 GAT7VAACAAT AATAGATAGT GGAGCCCTCG 
8 5141 CCGAAACCCA GAGTTTAACA T^GGCCGCGG 
852 01 TGGCCATATC AAGCAAATAA CGATCAGTGT 
85261 CACGATGTGG CAAACTGTAC GGTTCCACGG 
85321 CGGTAATCGT GAACATAATT CCCCTGCTGG 
8 53 81 ATTTCAAAGC GTCCTCGAGT CTCTGCACCA 
85441 GAAACCACGC CTGAAGGGGT CCATCCAAAA 
85501 GTTTTGAAGA TAGCCTGATT CCCTGCCCGG 
85561 CATAGTAATA AATCTTTATA TGCATGTGAC 
85621 ACGCCATAAG TCTGGCGCGC AGAAGGCCCG 
8 5681 CGTGTCCATC TTGATGACCT TCCCGATACC 
85741 AAAGGGCGTG AAGCATTCCG CGCCCGAGCA 
85801 GGAGCGCGCA CACGCGCGGA CACACCGGAC 

85 8 61 AGACCAGCGC CGTGAACGGA AATTGATCGG 
8 5921 TGCTCTTTCT TACGGCTCCC AGCAGGCGCC 
8 5 981 ACGGCAGTGG GATGTGTCTT AACTCACAGT 
8 6 041 AATCATAATT GGGAAACGAC GGTGGTGTAG 
8 6101 CTGCCCAAAA CAGGCCCGGG TACAGACCAC 
8 6161 GGGCCATGCC ATAACGAGAT GGGCCCCTTC 
8 6221 CAACGTTTAA AACCCATGTA AAGTTTCAGT 
8 6281 CTGTGACAAA CATAAAGAAA GTGGCACACG 
8 6341 CCCGCCGGTA ACATTTACTT CCTCCCATAC 
86401 GTTGTTCCCG AAGCACCGAC ACCCGCTCCA 
86461 GCGGCACCGT TGCTGACATC ATCTGACACA 
8 6521 GCAATGGGAG CATCCCATGT ATTTTCGGAC 
8 6581 CAGTCACGCG GCGGGACATC TCCTGGAGTT 
86641 AAGAGATCGA ACGCCTGGAG CGGCTGCGAA 
8 6701 TTGGTCAATG CCGTCATGTT GCGTTCTTCC 
8 6761 ATAACACAGA TCAAACCTCT CTCGCATGTT 

86 821 AACTCGCAGG GAATCAACTG GGACGACGGC 
868 81 CACAGGTGTC CGGGACGTTT GGGATCAGGA 
86 941 CGCAAAATTT GAACTCCGAC GTTCTCTCCA 
87001 AGCACAGGTT GCGCAGAAGA GTCCTGGAGT 
87061 CTCAGTGGCG CCTTAGCCCT CGGCCGCCGT 
87121 CTGCCCATGG AACAGCTATG CAACGATGTG 
87181 ACCGTAACCT CAGGAAGCAA TTGAAAGAGT 
87241 TTGCTCGGCG GTGTCAGCTC CCTCAAATAC 
87301 ATTTTCTTAA ATTTCTGGGC ATGTGAACCG 
87361 TCGTCAAATA TCTTGTTATA ATCTTCATCG 
87421 ACACGTATAA TGGTCTTGTC CTCGTCATCC 
87481 ACGGCCTCGA TGAACCAAGC TTTTAAATCC 



CGGCATTCCC GGCTCTAACC CGATGCCACA 
AGAGTTCTCA TCAAGGATAC ACGCATAGCA 
GGCGGTCCAG CCGCCACCGT GCGTCCGCAC 
ATTCTCCCTC AAACCCCTCA AAAATCGTCT 
AACCTGCAGA CCTCTCTCCA CGCAGTAGGC 
CTTCCAAGAC AGCTTAAAGC TTTTTTTCCC 
TTCTCGTTGA GATGGTAAAG GGTCCACGAT 
GCCATGACAG CGTACACCTA TACAATAAGG 
TATTTGCAGG GCTCTATCTC CTCAAAACGT 
GCCACTTCCT CATTTATAAA ACCATATTGT 
AACGACACGG TGAATGATCC CGAGTTTCGC 
GAGTTAAAAA GTGAAGCAAA AGGCATAGCT 
GACATTAGAA AGTGTTTGGC CCACACATTC 
ACGTGAC'CAT TCCCCTATCA CAACACTAGT 
CATTTTCCAG GTTAGAGAAA TTTATTAAGC 
TTTATTCAAA GTGCCTATAG ATTTCTGTTT 
AAGAGTTTCG CTCCTCGCCC AGGGAACAAC 
GGGGACCCGT GTCGCGCGGG GAGGGGGACC 
CAAATGCCCA CACGTGCATC CCGGTGTAGG 
TATTTCCCTG AAACCACACG TTAGGACCCC 
TGCTGCAGAA AATTATTCCA TCCACGAGCC 
CAGACGACTC AGTCGTCTGT GGTATCGGAG 
CACACACGTG GTGTCCCGCC TTATTTGTTG 
TGGCTGTCAA CAGCTCAGCC TGCAATTGCC 
CCCATGGATC GCCGGCAGAT GGCGCAGCAC 
AAGAATACGC TGCATCGTCG TCCTGCGCGT 
GTCTCTCTTG ATCACTTACG GACGGTCCCG 
TGGCAAACCG CATATTGCGC AACTCAAGGC 
AAGACATCTC TTCGGATGAA CGCAACCTAA 
TAGGGAATTC CTTATCTTCC TCAAAGTACT 
CCCTGGCTTG ACACAGCCTG CTCCTTCCAG 
AGGCATCATA GTGTTGACCG TCGCGTTGAT 
GTGCCAACAC CAGCCTCGTC TTTTGCTCGT 
TATCGGCTTT GAACGTCAGC CATTCCTTGA 
TCGATTCCCC AGCGGCCATG ATTCAGTCGG 
TCAATTTAAA ATGACACAAC TCCGCCCATG 
TGTGCGCTTT GGCCGTCCTG TCAGATGAAC 
GGTGCGAGGG TAGATACAGA TGTGCTTCCC 
GACGGTCCCT CGTCCCCGCT GTCATCTCCC 
TCATCATTGC ACACGTAGCG AAGTTTCAGG 
TGAGTAGACG GGCACATGAA AATCCACGCA 
GGCGATCTCG CCAGCTCCCT CAAATAATGC 
GGCCGCAGCT CATAGTAATT TTCAGAATAG 
AGGTTTTTCA CACAGATTCC AGACTCCGTC 
TTCTGCAGGG CCTGCAAATC TGCCCTGGCA 
AAGAAACACG TCAATGGATT ATCGGCACAA 
AGGCCACATA TACGCACACC CGAGCCGGCT 
AAGTAAAAAA AGGATACCTC AACAGCCCCT 
CGTTTGGATT TCCGTTTATA AACGGAGACT 
GTCCTCTCGA AAACATCCTC TCGAACGCAA 
GTCGCGATGA GGTTACACAG ATCGCAGCCC 
CTGTATCTTT CCCCAGAGAC GCCCCCGGCC 
CTGTGAGATC TCACTGCATA TAACATCCTA 
GTTTGACACA CGCCACGCGC TGAGCAAAAA 
ACGCGACTGT CGGTGCACCG GTTCCACGGG 
CATTCCACTC CGGGGTATCG CTTAGACTCC 
ATATCGCGTT CCGCCATCTC GCGTATTCAA 
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87 541 TACCTCTGAC CGAGCGGCTC AGTTAAAATA CCGAGCCCCC AGAACCCCAG GCCTCCGTAA 
87601 CAAAAAGGAT CTAGATCTGA AAGAAGCCTA ACTATATACG CCCACGGGAA AACTCTGTAA 
87 661 CT^TGAAAA AAATGTTCTC AGAAAACGAA GAGAGGAAAA CTGACGACAC GCAATCTAGT 
87721 CAAACCAGAA CCTAAAGGCC GTCACTGACC CATATCTTTC CCCTTCCCGA GCGAGTGCGG 
877 81 TTTCTCACAT GCCGTGAGCA AACGAAAACG CTACACATAA TAAGACACGT GTGAGAGGAA 
87 841 AACTTTATTG CAGGGACAGG GCAAAAGCAA GCTGTGCACG GTAACAGTAT GTGTCACTGG 
87 901 GCCGCATCCC CGCACGCCCG CCACAGACAC TCGTGCCAAA TTTGAATGGA CAGGGGGATT 
87961 GTGGGACACG GCTCGCCTTC CGGAGTGGGA ACGCCTCCTA GGTGGAGAGA CGCATAGGCG 
88021 GCTGGAAATC GCGTTCCGTG TGGCGGGGTG GCTGACACCG CACTCAGATA TTGGGCGAGC 
8 8081 GAAAACACGC GCAGAACTCG CGCGTTGTTA ATTATATCCA CCGGGTCCAG CGAACCTCCG 
88141 TTACCCAGCA GGAAGGCTGG AACGTGACCC ATATACCGAA TGTAAATCCC AGACCCGTCT 
882 01 GCGTAAATCA TCAACCCGCG TATGATCTCC TTCCCCAGCC ACTTTAGCAT GCTCGTCTGG 
882 61 TGAGGAAACA TATCCTGTGG GGTATGCGGC AACAGGGCCT GTTCCGGCCG AATGCCCGCA 
88321 CAAACGTGTC CTTGCAGTGG TCGCTTTGGC GCCGGCAACA GGCGAACGCC GTTGGGCGAC 
8 8381 CCGACCTCCA TGTTCCCAAC CAACCGCCCA TAGTAATAAA CAACCAGCCG TAGCATCCAA 
8 8441 CACTGCTCCC CGTCGCCTCC CACGGCGCCT TCGTACTCCA TCCCGGCACC GAGATACCGG 
8 85 01 AATCTGGCGG TTACCTCGTT GCGTACGTCC AGGAGCATGC CGCCGGTGGC ATCCGCTTCG 
8 85 61 TTACAGAGCG GGCAAGATCG TACAAAGATC GGCAACAGGC GAATAACGCG GCACCTGTTA 
88621 GGCGCCGGCG CCGACAGGTT CTCCTCAGTG GATACATCTT CGACGGTCTG ACTGTGGCGT 
886 81 ATAGCGCTGG CCAACCAGTT CTTACATTCC CTGACGCTCA ATCGTCTGTT ACCGGTCTGC 
88741 CGTATGCCTT TAAATTGACA ATAGTCCAGA TACATGTTTC TCTCATCGTC AGAGACGCCA 
88801 CCCGCGCCCT TTAAATGATT CCAAGGTAAA CGAACCAGAG TCTTTTCCTC ATCCAGCCAA 
8 8861 TGCATTCCAG GATGCTTATC ATTATCGCAA CAATCGAGCA ACCATGCCCT CAGGCCAGAC 
88921 GGCCGGTTTA CGCGGATCGA CCCTGCCCGT CCTTCCGCCA TCTCTGAGGG TCCGCTCGCA 
BS981 TAAACTGGCT TAAAAATCTA TCCCCCCCCG TGCACCTGTA CTTTACAGTC ACACCTTCGG 
89041 GCACACGCCG TAGGGCTGGC AGGGACGGCA CCCAGCAGCT ATTTTCATCT CTAGCGCCGC 
89101 CTACTAGCGG AAATCTGCGG TCTACAAAAT CCACCGAGCC GCCCGAATGT CACCATGAGT 
S9161 CAATTCATCC TGACAGATCA CCAAAGTGAC AGGGACCTGG GGTCTTTTAC TGGAGCCAGG 
89221 GCGACCCCCA AGGTAAAGAG TGACTGCGTT GCTGGGAACA GTGTCGCCGG GAACGGGTGA 
89281 CCGGGCGAGT CTGCGCAGTA GACCCATAAC ATAGAAGATT TGAGATTTGA CAGAGGATCG 
89341 CACAGCCCGT TGTGCGTGGT TCCCCTCTGC GTCGTTCCCT GCGCAGCGAA CCGTGTGCCT 
894 01 GGTTCTGTTA TCCACATACA TCCCCAGCTC ATTGCCATGC AAAAAGACGC CCTCCTCACA 
8 9461 CCCGTTTATA ATCTCACACA CGGTTTCTCT CTGCTCCGGC GTTAGCGGCC CATCCAACTG 
89521 AGGTTTTGGA AGCCACATCT TATACGCTGT CCCATAGAAA CACTCGTGAC CCGGTTTGGG 
89581 GTCTTGGCGC TTATGCAATC GTACCCCTCT ACGGTCCTGG GACGTGTGTT CCGCTACCAG 
89641 CGACCCAAAA TAAAAAACTC GTATACGTAG CCGCCAAAAT GGAGTGATCT GGTCTGGATG 
89701 CGGTGGTGCA ACCGTCGCCA ACCTGGCGGC CCTGGCATAT ACGGACCGCG CCGCACCGTC 
89761 AGCCTCCGCC GGAGCGGGAG GAGGCAGGGG CGTCCCTGGC TCAAGCCCAT GGACGCAGGC 
89821 CGCCTCTAAG TAATTTAGGA CGGTCTCCGA ATGACTATCT AAGATGCAGG CATAACACAG 
89881 CGGATCCACG GCGGGTCTCA AGTAAAAAAA TCTTTCTCCG CCAAAACGCG TTGTTGTTGC 
89941 CCGGGCATCG TCCTGTTCAA ACCCAGCGTT TTCCCGAAGC ACCCTTAGCA GTCTTTTCCG 
90001 GCACTCCGAC ACGTTGTGCT GGCGTCCACA TTCCAGATCC CTATCCAGGC AGTACGCAAC 
90061 CACAGGTTGC ATGCCACTCA TACCTCGGTG CCATGATATC CTAAAGCTCC TCTTTTCCTC 
90121 GTCGCACCAA CACAAGTGTT CGGGAAACGT GTTCGCTTCC AAATTGGAAA TAATCCATCT 
90181 TCGCAGGTGA TTGTGAGTGA TTTCTATCTC GGTCATTGTG AACTTGGTCC ACGAAAACTA 
90241 ACTCAAAAGC CCTCACCACT CCCCTAAACT GTAGCTGAAT TTTATGTCCA CCTTATAGGG 
90301 TGCACGTGTA CGTATATGCC CCGTGAAAAC CTCGCACTTC CTCATAAATT ATCCCCAAAG 
90361 CGTTTCGGGT CGCTCACCTA AACTAACTCG AGCATTACAC GGTTCATAGT GTCCCGGGTT 
90421 TCGCCACTCC CCCACCGCGA CAGAGAAACG AAACGTTCAA TTTAAAGGGC ACCAAGCTCA 
90481 ACTTTATTTA AGGAGAAAAA AACCACCGCC ATCAGAAGGT TTCCGCAGGC ACACACCCCC 
90541 AAGACTATTC TTGGCGAGTT CGCGGCACGC TGTAGTTTTT TTCTGATTAT TTGTAATTCG 
90601 CATCCAATCG CCCCAAACAT AGTCAATAAA AAAACCGTAA ACACACACAC TCTGGTGAAC 
90661 AAATTATGCG TTCCTGGCCC CAAGACGTGC GTGAGCGTCA GCAGCACAGC GGCTGCTAAA 
90721 ACATAAACCA GGGCTAAAAC GTTTGACCGA AACACGAGTC CGAGGCAACA AAAAAGCCCA 
90781 ACGTGGAAGA CAAAAAAATA GTGTACAAGT CCGAGGAGCA GAGGCGTCGT CAGGCCAACA 
90841 GACAGCGTAC CCGCGGCCGC CCCTCCAATC AAAACAGTCA CGAGATAAAA GTCCCATCCA 
90901 CACACCTCAA GCGCTTGCTT TATAGAGTAT GTGATGTACC GCCTAGAGGT GAGCGAAAAA 
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90961 ATTGCATGGC GTCGCTGCAG GCCCGCCTCA TAAAGCGGAT GTGACCGCAG CGCTCTGAGT 
91021 GAACAAACGC CACCTACTAT AAAGGCCACC AGGGGCAGCA AAAACACCGG GGCTAAAAAA 
910 81 AGGTTCTTAA GAACTAAAAA ATAATACACC GAGAACGCTG CTAAAAATCC AAGTTCGTAG 
91141 TATCTGTGGG CAATCGTGGG GCACAAATAT ACCACGTCAT TCGCCGCAAC AACACAGAGC 
912 01 AGCAGGCACA GGCACGACCC CACAAAAATG GAAACGTGAA CCCATGGAAG GTCAACGCCG 

912 61 AGGGTGGATG CCATAGAACC GGCGGTAATA AGAAACGCCG AGATGGTGCA CGGTACATCC 
91321 ACCTGCCGCG AGAGCCAGGC CGGTAGCCCA AATATACAGA ATAGCAAAAG CAGCCCATGT 

913 81 AAGCCAACCG CGCCCCAGGA GAATATAATG TTCTCAACGC TAATCAGGGG CTCTACAAAA 

914 41 TACAGCGAGT GATAAATAAA ACACAGAATA AACGGGGAAG CCGAGACCGT CCCGCTCATA 
91501 AAAGATAGCC AGGCGAGGGA CGCCTCCGAC GTGTACGTGC CCATGGCGCG GCGTTTAAAA 
91561 CAACGGGTTG AACGTCGCGC GAGGCGTCTT GGTTTTTCCT TTCCCGCGAC TGGGTGAGCT 
91621 GCTGTGTTTT CGCTTGCCGT GTTTTTTTTC CGGTCGAGAG CCGCTCGTCC GCAGTGATTT 
91681 GGCGCCCCCG TCTGTCGGCT CGGCACCGTC CTCGGCAATT GGGACGCACT GCTCTTTTAA 
91741 GTCCAGGTCA GGCGTTCTCG GAGGGGACGG GAGCTCAGGA ACCCCCTCGG GATCGGCACC 
91801 TTCCGAGAGA TTGCCCGGGA CGGCATTATT CCGGTCCGTC GGATCTACCT TTAATAGCAT 
918 61 GACGCCCAAC CTGACGTCAG CAAGGCTATC TGTCAGCGGC AGGCCCTCAA CTTCCAGAAT 
91921 CTCATTGGCG TGCCAGCGGA AACCAGGCAA AAAAAATCCC GGAATCTTAC ATGCGTTCAA 
91981 TGCGGCCACC AGAGACTCTA AACTCACATG AGTCGTCAGG TCGCTAGAAA CCACACCCAC 
92 041 GTCCAGCTTT CTTCCTCTAG CGCAAGTCCA CTTGGCGTCC GGGGTGTCGG GTTTAAAATC 
92101 AATAATTTTA CTAACCCCGT CGACCGTAAA CGTGGCAACT GCCAGCGTTT CGCTTAACGA 
92161 GACGGTGACC ACCTGCTTGG CCCTGGTTTT CTGCTGCCTT AGCCACTTTA ACAACAAAGT 
922 21 CAGCGTCTTA ACCGTAAGGA ACAGGCGATT CCCCGTCCCG GCCTGCTCTA GCCTGTCGTG 
922 81 GAACGAGGGT AGGTTGGGTT CGTCCACGAC GCTCTTGTGC ACGGTGCTCG TGTGATTGTC 
92341 CCCGTAGGCG ATGGTCGTCC TAACAAACTG CGGCTTTCCT TCCGCGTCCG ACTCGTCCCC 
924 01 GTGGTGGTAG AACGCCACGT TCATTTCCGC AATGTTAACT CCAAACATCT CACGGGTGTG 
924 61 TGTGAACGTG TTCCCAAGGG AGGTGTTTCG AAAGCTAAAG CTAACCTCGT CCGTCATGTC 
92521 AGAATATACC ATCAGCGGCG CCAGTGCATT TTTAAGCCTG AGACCGAGAA CGCCCGCGTC 
92 581 CCCCACGCTG GAGAGCACGC TAAGGACGGG GGCCCTGCCC TGGCCGCTGA TCTGTATAAC 
92 641 GCCTTTCTTA ACGATGCCCT TAATGTGGTC GTAAACCCTG CTAATGCTGC CAAGCGCATC 
92 7 01 GACGTCCACC CTGGCGCCGT AATGAAACGA GACAGGCATG GCTAAGGCGA AATTAACACC 
92 761 CTAAAAGACG AGAAGAGAAT GGCAAACACG ACAGACACAG CGTTCAATGT TCAACCTTTT 
92 821 AATAGAGCCT CTGGCCCGGA AAATGAGTAA CATGGGACAC ATGGGGACTA CAGATCATTA 
92 881 TGAACGGATG TGACATATTC AGAATTATCA CGCTCAAAAA AGTTCACACT CTTGGCACTC 

92 941 GTATACGACA AGGGACACGC GTTTGGGGGG CGGGCACCGA ACAGGGGCGG TATGTTTATT 

93 001 GACTTTAGAA TCCTGTCGGC CGTGGCCTGC AAAAACTGCT CGATGTCTCC GACGTTCACC 
93 061 ATGGTCACGT CTCGGCTCTT GGAACGAATA AACAGAGTCT CTATCTCCAC GGCTTCGCGA 
93121 AAGAGGCCGT GGATGTATGC GACGGAGGGG GATTCGTCTC GGGAGATCAT GGTGTTGTAC 
93181 AGCAAGCTGG CGGCCCTGGT GTGTAGCAGC TCGTCCCTGG AGATGTAGTC GTTAGCCAGG 
93241 CACACCCCGC GCATGATTCC GCGAAGGCGA AAAAGTCCTA TAGAGTAAAA TGAACTTATA 
933 01 AAGTATATCC CTTCGATAAC CAGGAACAGC AGCACCTTCT CGGCCCGCGT GGTGGCCTTG 
93361 CGCACTCGGG AATGCAGCCA GTCAATCTTC TTCGCCAGGG TAGCGTCCTC GACTATTTTT 
93421 TTGGCATAAA TCATTAAGTC CCCCCGATTA CCGCCAAAAA ATAAGTTCAG TATGTTGGCA 
93481 TACACTTTGC CGTGGATGTT TTCCATGGCT ACCTGTTCGG CGTAATAGTG GGTCACGTCG 
93541 TGATTGCTGA ACTCTTTAAC TAGGTCTTCG ATGTTAAAGT TTACCAGACT CTCTGCCATA 
93601 CCCAAAAATG TGAAGAGAAA CTGGTAAAAC TCCCTATCGG TGGGACTCAA TAGTCGCACC 
93661 TCATCGGAGT CCTCGTGCAG GGAAATTTGA CTGGGGAACC ATCTGTTCTG CGATGTCTCC 
93721 TGGGTGAGTG CTAAAAAGCC AGGGTGGTCG CTCGCGTACA GAAACCGATC TGGGGACTCC 
93781 ATGGCGGCGG TTACTATTGA CAGGCCAAAC ATACCTTGGT CCGCTCACCG TCGGCCTTTT 
93 841 GTACGCAGTT AGACTCCGCC CCCTCTGAAG AGATGTTGTC ACCCACGACC TGATCTAAGC 
93 901 ACTGCAGGGC ATTAAGTTTT GTCTGTTTCT GAATTCGACA ATAATACAGC ATGGTTTTTA 

93 961 ACCCTAGCCT GTATCCGGTC AGGAGCAAGT CTCGAATATA ACTCGCCCGC GTGGCCTGCT 

94 021 CCTCCTTTAA AAAGAAATTA AGCGACTGAC TCTGGTCTAC AAATGGAGCC CGCGCACCGG 
94 081 CCCGCTCCAA CTGCTTAATT GGGCAGTAAT CAAAGGCCGT CAAAAAAATC TTGTATCTGT 
94141 CCTTCAGGGG TTCTGGAAAT GAAGCCACGT CACCTCCGTA ACGGCGCACC GTTCGCAGGT 
942 01 CACCAGGCTT CACTCTCTTA AAAAAAGTCA CGTTTGGCTT CAAGATCTCT TCTTTGCTGG 
94261 TGACCTTCGA TGCGATATTG GCAAAGAACG GATAAAATGC TTCGGTGTAC CCCGTAAGCT 
94321 GAGAGGTTCC GGCCGTTGGC ATCAGCGCCA AAAACTGGCT GTTGAAAATT CCATGCTGGG 
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94 3 81 CAATGCTGCG CCCCAGCTGC TCCCATCTCT CCAAGGGAGG GTGGGACGGC TTAACGCCGT 

944 41 CCCACGTTTG CCAGTGAAAC ACACCCTGAG CCAATCGACT CCGCTCCCAT CCACGAAACG 

94 501 GAGTCCCCTC GCCGAGTAAA ACAATCTCAT GACTGGTTTG CACCGCCGTA AAATACATTG 

94 561 CCTGAAAAAT CTCCACGTCC AGCTTGGCGC TCTCGGCGTC AAGGTAGCCA AAGCCCAGTT 

94 621 CGGCAAACAC GTCGGCCAGT CCTTGGACGC CAATGCCCAT TGATCGTTCC TCCTGGCCGC 

94 681 GCCTAACGCT CTCGGTAGGC GCCGTCCCGC CCAAAATGCA CGCATTGACA ATGATAACTG 

94 741 CGGCCTCCAC GGCATCATCG AGCAGTTCAA AACCAAATGT TACATCCCCC TTCCCAGGCT 

94 801 CCACACCAGA CTCTCCCCTG TGGGGTCTGA GGCACTTTGG TAGGCTAATG TTTGCCAAGT 

94 861 TACACACGGA AGCCTGACCC TCGGGTTGCT GCACGATTTC CGCACAGAGA TTAGAACAGT 

94 921 TTATGGCGCT GCCTTGCGTC TCACACCAGT GGTGTTTGTT GAGCGCCTCC TTTAGCAGGA 

94 981 CGTAGGGACT GCCGGTCTTA ATGACAGTGT TAATAAGGGC ATACATCATC GATTTTAACG 
95041 GCAACGAGCT AGAGTGTTTG CCAGCGGCCA CTAGTCTGTT GTATTCAATC TCAAATTCGG 
95101 CACCGTAGAG TTTTAGGAGA TTTGGCGCCA CCTCTGGCGC AAACAGGTGC CACTGGCCAT 
95161 CTGGGTTTGT TTCGTACAGT CGGAAAAAAA GCTCCGGCAC ACACACGCCC TGAAACAGGT 
95221 TGTGACACCG CTCCTGATTC TCCGGCATCT TCGCGTTCAA AAAATCACAA ATCTGATGAT 
952 81 GCCATAGTTC CATGTAGGCG CTGGCACCAA CAGGACGGAT ATTGTTATCG TTGAAATACC 
95341 CAACGTGGGC GTTTATTAAT TTTAAACAGC TGGTGATGTT CTTGTGTTCC GCAAAAGACG 
954 01 AGACATCTAT CCCCACGCCT GACTTGCTAG CGAGAAGGGG GGACATTTCC TCATGAAGTG 
954 61 CTTTGAGGGT TTTGTCTTCG GTCGCCATGG ACGGCTTTAA AATAAAACAG CTAGAAAGCT 
95521 GACCTCCGCG AAGCCCGGCT GACCTTAATA CAGGCGTTGC GCAGCACACA ATCTGTGACG 
95581 AGATGTTUVTG GAACGCGTAA CCAACCAGGT ACATCTCATC CAGCTCCGTT TCGCTCTCCA 
95641 CCAGGTGTCT GAGGGTCTCC CGCAAACACG GAAATTTTAT ACACTGACAG GCCACAAAAA 
957 01 CAGCCACCCT CATAAACATC TGGGCCACGC TTTCAAAAAT GGGTGAAGAA CCCTGGGTTC 
95761 TCAGCACGTA CGTATCGTAA AACCTAACGG CAGACAGGTA GCCGCAGTTA ACGAAATTTG 
95821 TGTATGCCTT GCTCTGCTTA AAGTCCTGTA AAAGACCGTC AAGGGCCGCT TCGTGTTTTG 

95 8 81 ACATAAACGC GCGAACCTCG TCGCTTAGCC TTTCCCCGAA TACCGCGAGA TAGTCTCGCA 
95941 CCGTAACCAG GCACCTGTCC TCCATTATTC TGTGCCAAAG GAGACCGGTT AGTGAGTTGG 
96001 CCTCTATGTC CCATCCAAGT GTTGCCTTTA AAGAATTCAC GAGTGATTCG GCGCATCGTT 
96061 CAGGGTCAAG GCTTGAGCTA CACCCCCCAG TTCCCGCGTC GGTAACTAGG GTTAAAGGTT 
96121 TGGCAGATTT TGCCGCAGAG AAAGAAGTTT CGGTGTTCAT TGTTAAAGAA ATAGCTTCCA 
96181 AGACCCCGGA TTGGAAACAA ACTCCGTAAA TTTTAACACC GGTAAAGCAG CGCCTTTAAA 
96241 GTGAAGGCTT TGAAAAGATG GTTGTAAACC GGAAGGCACG CTTCCAAGTC TGCAAACTAC 
96301 GCCGAACGCA AGCCTATTTA TATACAGGTC ATTCTGCAGC TGAATGTATT TGGTGCGAAT 

963 61 CACGCCGCTG TAAAAATCCC TCAATTGGGC AGCTATTTCA CAATATCCTT TACCAGACTT 
96421 AAAAAACCCA AGTCCTAGCG CCGGTCTATT ATTTACCCTA ATGTAGGTTA AGCAAACATA 

964 81 AATAGACGAG TGGGCGGGCA AAGGCTCGTC GCGCAGGCTG GACGGGCATG AGTCATCTAA 
96541 ATCCACAAAC ATGTCACTAG GAAGCGTAAG GCCAATATGT GTAACAACGG GCTCTCTGGC 
96601 GACTACGTTG CCCTTTAACG CAGACGTCAC CTTGGTGACA AACGTACTGT GGACCGTTTG 
96661 AACCAACGGC CCGACCGGCG CAAGAAACTG ATGAAGCGAG CCGGTTTCCA ACAATTCTTC 
96721 AAAATTGGGT ATGGCGTCAA GTAGACCGCT CTCGTGGCCG TACCAAACAC ACGCTATTCT 
967 81 GTTGGTCTGG GGGGCAGAGT CCGCGTCCAT CCTAGACAGT CGCGCCAGCG ACGTAGGCGT 
96B41 GAATAACATG TCAATGGAGG ACCCAGTGTC AGTCTGTTTA AAGGAAAACA GGTAGGTGCC 
96901 CCGAGGTTCC TGTGAACTCA TGGTCTGAGA ATAAATCAAA AAATCTCCAT ACGTTTGACA 

96 961 TGTAGGCGAG ACAGATAAAA ATCCATCTTT GATGGCCTCC ACCCCAGTGG TGGTCGACAC 
97021 CACATATTTA GAGAGCAGAT CACGAACACC CTTAGAAAAG TCGCGACCGC GAGATAACGA 
970 81 AACCCGGTGA GGAGGCGGCG GCAGTAGACG CATCAAACTA TCATTCAGCT TGTTCACGTT 
97141 TGCGTCCCTG GTCTTCATGG CGTCGTCAAT TCCTGCCGCA CGGGCTGATA ACGGTGACGA 
972 01 GAACACCGGC GGGCTTTACA AATTAACTGA CAACCTCCTA ACGTGCACCG GATCGCTACA 
97261 ACAGCTTAAA CTCCTGATGG AGTTCCAACT AAAACCACTA CCAACCGCAC ACCTTTTAAG 
97321 CATGCCCACC GTGACCCGGT TTTTAAATAC TGCATTCAAA ATAGACAACC CCCTGGTTTC 
97381 ATTTATTCAA AAACACCCTG TGTTTTTTTT AATGAGAGTC GCCAGGCTTC CGGAGCCAGT 
97441 CATTACAGAC CACCAAAGCG CAGAAACGTC AACAGGCATA CTATCCGAGG TTGTGAATGT 
97501 TCTTAATACA GCTATTCGAA AACCACACGA GTCCCCAGCG GCTAAAGACA ACGATTATCT 
97561 CGACAACCGT GCCATATTGG CCATGATTAC AGAATACATC CATCACGTAA CTTCACGTAC 
97621 GCCCTCGGGG ATCCCAGCGA CACCACCAAT GGGTATCAGC CATCTACCGT GCGTAGAGCA 
97681 AATTTTACAC GAAACCCACC GGCAATACTG GAACCTAACC CTCCCGGAGT CGCTATTTAT 
97741 CGACATCGGC GAAGTCGCGT CTCCGCTTCA GACGTGGCTA ATCCTATCGT ATTGTAAAAA 
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97 8 01 ACTGCAACTG GCACCACCGC CCCTATTCCC 
97 861 TACCGGCCAC CACGAATTGT TTGTCCCCCT 
97 921 GCCAGTATCA AAACAGCGAG CGTTTGAGAT 

97 981 AGTTGACGGC ACACCTU^TTT TAGCGTTCAC 

98 041 ACTGTTATTC CTGTACGACT TCGTAATCGA 
98101 TTCGCGCAAC GCAATTGAGC ATTTTATTAA 
98161 GGCGTTCATT GAAAAAACGT GTGGCTATCG 
98221 CGTTAAGGCC AGGCTCGCGA GCTGTGGCCT 
98281 AATGATACTG ATGACACCGC ACGAGACGAC 
98341 AGAAATGGTA AACCAATTAA CACTATACGG 
984 01 CAGCCCCACT AGCATATCGC TGGCCAAAAT 
984 61 ACAGAGCGAT CGCGCCCTGT GGCGCACTCC 
98521 ATTGAACAAC GTGCTGGCGT TTTTTAAGCC 
98581 CTACAAGGCA ATACCGTCGT ACCTAATGAG 
98 641 GTGGGGTAAC ATCGCGCTCG CAGAGAGCGC 
987 01 AGACCAGGGT CCCGTGTCCG CCCAGGTTAT 
98761 CGCGACAGAT TATGACGCAG CCATCGTCTC 
98 821 AACAAAACTG TATCCGATCC TTAGCGAGGT 
98 881 GCTATTCCAA ATTAGATGGC TCATAGTCTT 
98 941 TATCAGACGC TCGCTGGCCC TGGCGTATTT 
99001 TTCTCCGGAG TCGTTTTACA ACCTGCTGGA 
99061 ACAGGTGATA CCGGAAGCCA CGTGCCCACA 
99121 AAACATTCCA ATAGCAGCGA GCTTCATTCA 
99181 AAACGGCATC CCCGGTATAC TGGACCTTGT 
99241 ACTACTGTAC GATTCAACGC TAGACACCGT 
99301 CGTCGTGAGC ATGTTCGTAT TCAAATCCAG 
99361 AGCCGTTATC GCGTTAACTC AAACGTCCGA 
99421 GCAACACCTC ACGTACATGC AAAAGATCGC 
99481 ATCCGTGTTT CACGAAATAC ACGAGGGATA 
99541 CATGTTACAC GTTACGGGAA GCTGCTGCTA 
99601 TCAACCCCCC CTAATACCCG ATACCATCGT 
99661 GGACACAACC GACGACATCT TAAAGAGCCT 
99721 GGCTGAAAAC GAAAGTAGCA GTCGACTCTC 
99781 GTACGACGAC TTTCGGACCG CCTCCACAAA 
99841 TTCGGGTAAT TATAACGAAA CACAAGTGTC 
99901 ATACACCGCA CCCGACGATA CGTTAAAATT 
9 9961 CATGTTTGCA GAATAAACGA TGGAATTGCC 

100 021 CGTGGCAACG ACGCACCAGG CCGACTGCAG 
1000 81 AAGCAACTGC GTTATCTACC TCGCACAAAG 
10 0141 CACTAACGAC CTCGACGACG TTTTACGGCA 
100201 GTCCGGGACG CTCGGCTATA ACCAATATGC 
100261 GACCAACGAG TGGACCGCGG CCATCTTTCA 
100321 TGACGCGGCC ATCCGCGAAC CTTTCATCGA 
100381 CGCCGGCACC GTTCAATACT TTCTGTTTAT 
100441 CAAAAACAAA ACGTTTTACC TGTTCGATCC 
100501 GGCACACGTA ATCAGTTCGT CAGACCCCAC 
100561 CAGAGAATAC ACTGGCAGTT TTCTATACAT 
100621 CTACATCACC AACCACTATA GAACTATAAC 
1006 81 TATATCCACC GGCATAGAGC CGTGCACCAT 
100741 GGATGTGACG TCAAAATCAT CCAACCTCGC 
1008 01 GAGCGCCAAA CCCCCACCGG CGACGCTATC 
100861 CTACCCAGAC CCGGCAACCA ACGACGCGGA 
100921 AACCGCCGTC GACCACCCCG AATTCCAAAC 
100981 ACTGTCGGCA TCCAGGGGTC GGAAACGCAA 
101041 CGACGAAGCG TCGTCTGACG ACGAAGGGGC 
101101 ATCCGCCGAG GTGATCTGGA TGGACGACGA 
101161 TCCATCGTTT GACGACGTGT TCGACAGCCC 



ACCTGTCGAC GAACTGGCCC GTCGTCTGGT 
ATCGACGTCA CTGGAAACGT ACATCACCAT 
ATACAGCGTC TTTGCCAAAT CAAAAAACAT 
CGACACAGAA CTTACAACGT TCACCCCCGA 
GTCCCTGTGC AAAAACCAGG CATACGGGTG 
GAAAGGTATC GATTTTATGG CGGAGTTGGG 
ATCAACCGTA AGCCTGTCCA ACGTGAGAGC 
ATCCAAAGAG GCGTGTGAAG ATTTCCGCGC 
GCCAAAGTGG GAAAATTTTA CGGACTTTTT 
GTTCTATTTT TACGAGTGTC TTAACCAGTA 
TCAAAATATC TTAAACCGAG TCGATGCCGA 
ATTAATCGGG TCTTTCCCGT TCCCCTGGAA 
TAGCACCCCC GTGGCAACGC TACAAAAAAT 
GTCTTTGTTC GAAATAGCGG CCAACAAATC 
CCCGCTAACC GACATACAGA CCGCCGAACC 
CGCAAAATAT TGCAGCCGTC TCCAAATAAG 
CAGCCCGGGA TTCGCCGCCG AGTTCATTAA 
GCTCCGAAAC ACGTCCAAAA AAAATCGATC 
CGCGGCAGAG GACGCCAGAG ACCTAGCCCC 
TCAAATAATG GACATTTTGG AAGAAAAGCA 
CTATCTTCAG GAAACATTTA GGTGCATACG 
AGAATTTCTA CAATACTTGT TTACTTTTCA 
AACATCCATG ACCTTTGTAG ACGACCTGAA 
CTCCTTAGGT GCCGCGTTTT ATAACATGAA 
CGAGATTCCA ACGGAAGAAG GGCAACCCAT 
GATTCGCGTC CTCGAGAAGC TCTTACAGGA 
ACCGATGTAC GCAGCGCACA TCCGTCTGAT 
CGGACACGAA ATAATGACCA CACAACTTCC 
TTTGCAGTGT TTTAAGCGTT TTAAACGTCT 
CTCACTGACG CGATACTTTG GATTCCTATA 
ACAAAAAATT TTAAACTTTA ACGACAAAAC 
GTCACAGCCC GTGAGACAAG GACCTCTATC 
AAAAAACAAC GTTGAGCTGC TTCAAAAACT 
CAATAACCCC ACCTCTATTA AACTTGAATA 
CGTAGATTGG AGCACATATA ACCTGGTGAC 
CACCCCGGTT AACACGGAGG CACTAGATCG 
GCCAATTTTT TCGAAATTTA AAATAGAGGG 
GTTCGGACAA TACGCCGGCT CGCAGTGCTT 
TTATTTCAAT AGAGAGTCCC CCGTGACGGA 
AGGGGCGACG TTGGACTTTA TACTGAGACG 
CCAACTGCAC CACATACCCA GCTTTATCAA 
ATCTCAGGAG TACTTTGGAC TAATTGGACT 
ATCCCTAAAA TCGATCCTAA CGCGAAATTA 
ATGCGGCGAC AAAGCCGGGG CCGTAATTAT 
CCATTGCGTA CCCCACGTAC CAAATAGTCC 
CGCCATACTA GAGTACGTGT CACCGCCAGA 
TATGCCCAGC GAATATGTCA ATCCAGAGCA 
GTTCGCCAAA GTACACGGCC CGCACATAGA 
CGAAGACATC CCAAGCCCGC CGCGATCGCC 
ACGCGTGCCG AGAACCACCA CCGACACGTC 
CGGTTTACGG GGCGCGGAAC CACCGACAAG 
CACAAAACTC TTAACTCCCG CTCCAGCGCA 
AACACCTGGA GCCACGCTAC TTCTTTCAGA 
GCTTTCCAGC CTTCAGCGAT ATTCGGATTC 
CCCACGTAGG CGCGTACACG ACGATGCGAT 
TATATCTCCT CTATATTCAC CGTCGGCGAC 
CCCAATGAGC CCGGAGTTTA CATACGAAGA 
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101221 CGCGACAGAG GACACGGATG GCGCGTTTCT 
1012 81 GTTCTCTGCG TTCGACGACC TTATAACGGA 
101341 AGAACAGTTA ATCAAGTACG AAGCGCCCTC 
1014 01 AAACGGGCGA GCCGTCCGAG AAGCGGCGGC 

1014 61 TATCATACTT GAACACGGTC TAATTACAGA 
101521 AAACGTTCTT CAATTTTTCA TCCTGTGGGG 

1015 81 CAAGCAGGTC CTAGAACTCG ATCTGCAACT 
101641 AAAATTCAAA CAGGGGGCGT TCAAAAAACA 

1017 01 GTCAATGAGG GCCACGCACG CAGATGCGCA 
101761 AGGCTCACAG ATTTCGTCCA GCGAAGCGAA 
101821 CGCCAACCAC CTGAGTCCAG GCTTTTTAGC 

1018 81 ACGGGATAAA ATTCAGGACC TAAAAACAGG 
101941 GGAAGAACTG TTTTTTGATG CCATGCTCAC 
1020 01 AACGGCATTT CCAATGGAGA TCTTTCCGCA 
102 061 GGCGTCCATA ACCACCAGGT TAACCGAGGA 
102121 GACCCCGCCC GACCAAGGAA CGCACATTAC 
102181 AAATATCATA TCCACGTTAA AAATCCTAAC 
102241 AACGCCCATG GCAACGTACA GGCGTCAGCT 
102 3 01 CTTTAATTTA GAGTGGCCAT ACGAAACCGT 

1023 61 GCGGGCGAAA GCAAAAATGG AATCGGTAAC 

1024 21 CCAAATACTG GGAGACGCCG AAACGTTACT 
1024 81 GAACCCGGTC CGCGCCATGT CCATACCGAT 
102 541 CCTGATAGGC AGTTCTCGAA ACCAGCGGTT 
102 6 01 GGCATCATCG GAGTCGTTCA TAATAATGCT 
102 6 61 AGACAATCTG GCCAAGATCG ACGGCATCCT 
102 721 CACTGTTAGC AT^CGCTCC AAACGCTGGG 
102 7 81 GCTAAATAAA AGGAGCCCCT CTTCCCTCAA 
102 841 ACTGGGGCAC GCGTCCGTTC CAGATCACGA 
102 901 CAGCGCACAA AAGGAATCCG CTGGCGATGA 

102 961 CCTAAACGAG CTGAAGCTCG TAACTACCCA 

103 021 ACTGTTAATG ATAATAACCC GTGACCTCAA 
103 081 GGAAACACGG TGGCAAGAAA ACGTGCTAAA 
103141 AGACTTTTTA CAGTCGGCAC CGTCAGCAAA 
103 2 01 GACGCTGATC ACCCAATTCA ACGGCCACGA 
103 2 61 CATGGACTAC ACGCCGACGC CCATACCCAC 
103321 AAAGGGAAAG GCCGCATGGA ATAAAATTCA 
1033 81 CATCGACGCT TCGGATTGGC AAGAGATGGC 
103441 TCCTGGTACG GTTGGACCAA AGCTGGTGCG 
103 5 01 CGACATCCTC ACGCAGAAGC TGGCATCTCT 
103 561 AGCGTTTGAC TGGATCGCGC CCTATCAAAC 
103 621 CCTGCCCATG GTGCGCAACC TGGCGGACAA 
103681 CGCGGTGCAA TCCGCAGACC TTCAACAGGC 
103 741 GGCCGAATAC TGTCGAATAC TCTCTGACAT 
103 8 01 CGTAAGATCG GAGGCCGCGG CGTACACGGA 
103 861 TCCCCCGAAA CCCAACCTAG AAGCCCCCAA 
103 921 CGTCGAGGAC TTTCCAGATT TCCTAAAAAC 

103 981 TGCGCTCCAG AGAGCGGAAT TTCAGCAACT 

104 041 CCGCCAATCC ACCCGTGACG AGATCGCAGG 
104101 ACCCCGCGCC CCCGTCGCAA TATCATCGAG 
104161 TTTGAGTTCA ACGGTATACG ACAAAATCCT 
104221 GGGATTCGCG TGGCTGGAAA TCGCGACAAA 
1042 81 AACGCAACAG TTAAACGTAC TGCTGAGCGA 
104 341 TCTACACGAT TTGGAACTGT CGGCGAAAAA 
1044 01 GCTAGACGAA CTCGCGCCCC TCAGGGTAAA 

1044 61 ACAAAAACTG GAAAGCATAG AATCCCTGCT 
104 521 ATCGGAGCTT GT^CGCATCG GCACACAGGC 

1045 81 AACGCTCTCC GATCAATGCC GGGAAGCCGC 



AGAACAGATC GCTCGAGACG CGGAAACGCC 
CCACGACTTT TCTTCCCTTG ATAAAAAAAT 
GCAGCACCTG CCAAACATCT CGGACAAACA 
CCTCCAGGCG ATGGACAAAA TTATGATTAA 
CGCGCAGGCC CGGGGACCGT CCGCGTGCAA 
AGAAAAACTC AACATACCAA TAAGCGACGC 
GATACCGTTA CATACGGCTA TCAGCGAAGG 
CCTAACAACT AAAATCAACC GGTGTCTGGC 
AAAAAAACTG GCGTCGGCTT TCAACGTCGA 
AATATCAGTC CGGGGGCTGA AGGAACAGAT 
GGTCTACTCC GCGGACGAGG TAAAACATCT 
CATCGAGCAG CGCAACAAAG AAATCCAACA 
AGCCCTGGAC ACGTTCCAAC CCCCTCCGAA 
CCGTAAAACC GAAGTTATGC TCGACCACCT 
CGCCACCGAA GCCCTCAACA ATTACCTGGA 
CAACATTCCA AACTTTTCAT CCATCGTGGC 
GTACGCAGAA AACGACATGC AATTAAACGT 
GTTGTACCTC GGAGGCGAGT TAGCAACCAT 
GCCACCGGTT CAAGAACTGC CCCTCGTGGC 
AAAAATGGAA AAGAACCAAC AGGCTCTCGA 
TGACACAATA ACCGCAACAT CCGGAGATGA 
ACTGGAGACC TACATTACAA ACGCAGGCGC 
CGAAAAACTC AAGGCCGCCA TCCACGACCT 
GCTAAACAAC ACGCGGCTCG ATAACATATC 
GACCAACAAC ACACGTTTTC TTTCAAACGC 
AGGCAGCCTA ATACGCGAAT GCGTAGAAGC 
CAACGCGCGT CTCCTCGCGG TTCAAACCAT 
GACGCTGACG CGAATCGTTT CCGGCGTCGC 
TCCAGATAGG TGGACGCGAG TAACCGGTCA 
ATCGCGTGTC GACAAAGCCA CCAGGCGCAA 
GGAGGCGGAG GTGTCTCAGG AAACGGTCCT 
GTTTCAACCG TCGACGTCCA AAGAAATCGA 
GGCCCGAAAA TTCGCAGAAA AACACCTACG 
GCGACCGCCG TCCGAGGCCA CCGCCGTTCC 
GCCACAGGCC GTTTCTACGG CTACCGCGGA 
ACAGGCCTTT CAGGATTTCA ACTTTCACCT 
ATCAGAATAC TCCAGACACG GCTCGTCCCT 
CTTCATGGAG AGCATCTCAA ACACCCTGGA 
GCTTCCAAAC GGGCCCGCGT TCAGACCCCC 
ACGCGTAAAC GCGTTTCTAA AAACCATAGG 
GATCCATCAC CAATGCCAAA CTGTCAGTCA 
CACGGTGGGA ACAAGTTTAG AACGACCCGC 
GCAAGTCGCG TTCAACGACC ACGGAATCGC 
CGCAATCAAC TCGCCGGCCA ACGTCGTGAC 
GAAGCTAATA ACGGCAACTG ACGCCCTAAC 
GTCAATCCTT CAACAGGAGC AGCGACTCAT 
AGAGGCCAGC ATCTCGGCGG CCGAACGGCT 
CAAGATGGCA ACCGCTATCA CGCAACTCTT 
ACCGTTGAAC TTATCAAAAC CTATAGACTT 
GGACAAGGAG CCTTACGAGA CAGCCATAGC 
ATCCGTAATG GTCTACAGTC AACAAAACGA 
GGTAGAAAAA CAGAGCACCG TCGCGCAGCG 
CACGGACGAC GTAAAGGTGC TGAAGCAGGC 
GGGCGGAAAA ACCACCGTAG ACGCGTGGAA 
TCGCGCCACG AGGACGGCAG GCGAAATATC 
GGTTGGCACC ATCACCGTCC GCGATTTAGG 
AAATTTCCTC AGACAGGCCA GTCTACCCGA 
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104641 AGGCTTCTCG GACATAGGCA CAAAACTCAG CGAGCTTCAG GCGTACATTA AGTACAAAAA 
104 7 01 ACAGTTTCTG GAGCATTTTG AAACAACCCA GCCTAACGTC TTTCAACGCT TCCCGCTATC 
104 761 CCAAAACATA ACCGAAAACG TTCCGGCGCG CCCGGCGATG GACTCGGTGG CCAGACTGAC 
104 821 CAATCACCTT CACGTGCGCG GCAGCGCGCC CCACTTTACA ACGTGGATAG AAACGCTACC 
104 881 GACCGTCGAT CCGGAAAAAC CAACTCACGT CCCGGCGCAC GGAGGAGCCC CTCTGCACCG 

104 941 CCAGATCACG TACTCAAACG TCCTAGAGGC GTTGTTTTCA TTATGTTCCA CCACGCTAAC 
105001 CCCGGTTCCG ACGGCCCCCG GTCTGGAAAT CGCAACCAGG GCACGCCGCG GGGCAGAGGC 
105061 cdt^CGTGG ATGGACAGAC AGTGGCCCGA CATCGCTCAG ACGCTCCAAG ACGTTCTCGA 
105121 CACGTACGAA CACACCACCG CCCACGCAAA CCGGGACGCG GCATTTAACA CATTCTTGGC 
105181 GATGTGCGTT TTTACGCAAA TCATCAGGGG CGCTAGCAGA GCCGTGACGC TCCCGAAGTT 
105241 ACCGAGCACC GCCGTCGATT TTCCAGAAGA GATCGTTCTA ACACCCAGGG AATGCACAAC 
105301 ACTGGTCACC GCCATGTGGC CCACCCTGGC GGCCGCAATC TTACGATTAA AATCCTACTC 
105361 GGAAGCCCTA GGACTAATGA GTCGTTTCCT CCCGCTAATG TTCCAGGCGC TGCCGCACCT 
105421 AACGCTAGAG GCCCAGGTCA AAAACGGCCC ACATAACACC CCGCCTCAGT TGAGATGCTT 
1054 81 TGCCAAAACA GAGGCAATTC CGTATTTCCC GGCGCAATGG CAGTCAGCGA ACCTAGAGCA 
105541 GAGCCTGTGG GGACAGACGG ACTTTTTGCA AATCTGCGAT AACAATCAAC GCAAGGCCAG 
105601 GGTGGCGGCC GTCACCTGGG CGCTCACGAC GATAGACGGC GTGGTTTTGG ACCAACTGTG 
105661 GTCCACATTT AAACCCATGA CAGCCGCGTC AGACGACACG TACGTCGACC TAGTCGAGAC 
105721 CCTACACCTG ACCACCTTTG GCCCGCGCGG TCCAACGCCG AGGCGAGAAA CGACCACCGA 
1057 81 GCACCCGCCG TACGAGTACG GACAGCCCAC GGGCTACTGC ATCTCGGGTC AATCGACGAC 

105 841 GCCGGTCCAG GCTTCAAACA CACCGGTATC CGCTTTCGAG GCGGTGCTCG GAGCAATGGT 
105901 GTTTCACGTA CCGATCAGAA TATTTTTGGC GGCCACGCCC AAGCGCCTTG GCCAGGCGCG 
105961 CGGCGGCATG GGGCTCCTCA CACCCATCCT GGAATGCGTC CCCGACGTCG AGCCCTTCAA 
106021 AAGCCTGTAT AACGCACCCC GCAAACCCGT GCCCATTGAA ACGCTACCCG CATCCCTCCA 
10 60 81 CCCGCACGAC GAGCGACAGG TCTTTCTGAG ACAGGCACAG TGGCTATCCT ACCGATTCAC 
106141 ACCACACGAA GCCGCCCGGT CGTCGACTCC GCCGCTTCTG GTGGTCATAG ACCCTGAAAA 
10 6201 CCTCGTAACG GCAACGTACT CCAGTGGCGG GCCTGCAAAT TTCGAGAGCA GGCCGTTTTA 
106261 CGTGATGCCC GGACCATACC CCCCAGACTG GCCAAAAACG CTGTCGGTAA CATCAAACAC 
106321 GTCCGTGACG CACCTCAGCC ACGACGAGAT ATGTAACCTC TTTACTACGC TATCCCGAGA 
10 6381 ACACGGGACC GTGCAAGGCA GGGATATCTT CGCAGCGGCT CCGACAAACG TCACACCGGA 
106441 ACAAACCGCC AATCCTCCGG CATGGGAAAC GGATAACCGA TTAATAACGC AAACAGAAAC 
106501 CGCCAAAAAA CCTCATATAA TTCCTGCGTC TCCTAAAGCG CGGACAGATC CACCGGTGGA 
106561 AACCACGACC CACCATTCAC AAGGGCAAGC GTCGCAACAC GCAAACAGCA ACGTAAACCA 
106621 GCCCGGTCAA ATTACTTCAC ACGCGTCACG TAACACACCG TCAACCGCAC CTCAGGCCTC 
10 6681 ATCTTCACCG GAAAAATTCA ACACGCAAAC GGTGCCTCGA CTAATATCTC AAACGTCGGA 
10 6741 AACGGCCCAT ATAAACCAGC CAGCCTCCGG CCAGGTCACC GAACCAAAGG GAATCTTTGG 

106 801 GACGTATAAA CCCCGAGTGC TCACCGAACC CGCCAAACCC GCAAACGCCG GCGTAGCCTC 
106 861 TCGCCAACCA GAGGCAACCA CCACGGTCCC CAAGTTACCG ATTAATCCAC CCACCGCTAG 

106 921 GGTCTTTATA GGGACCGCGT CCAAACTCTC GCCAGCCGTC GAAGAGAGCC ACGGCGCCAC 
10 6981 ACCCGACGCA CATCAGTCGA AGATAGATCG GGAAAAATAC GCCGAGAGTC GGCCTCGCCG 
107041 CACCCCACAC CTCGAAGAGG GGCCACGGGA GCCTCACGTC AACACTCCAA CCAGCGCACA 
10 7101 CATAAACGTC CCCTCTAGCC AAGGTCAAAA AACAGTACAC GGGCGCGAAA ATCCCGGCCT 
10 7161 TCAAACAGCA ACTCCCAGCG CCCCCCAACC AACCGCATCA AACCCGCGCA TTCAATACAC 
107221 GCTCCCCAGA ACGGACGGCC GGTTGCTTCA CGACGAATCG GAGGTGGAAT CGACCCCAAC 
107281 CGAGGAGGTA AAACGATCGC CAAAAACACA AGATGTGTCT CACGGGCCCG AACCGGACGA 
107341 CTCCAGGTGG ACCGCCCCGC TCGGTCCAAC CATAGAGATT CATCGACTGG AACACCCCCA 
107401 AATTCTCAAA AATATAACAT CACTCACCGT CCCCACTCCC AGAGTCACCC CAATCCCTCC 
10 7461 CACTAACATC TGGATACCCC TATCCCACGT CAACATCCAA CACGAAGAAA TCACACGAGC 
107521 CAAGAATGTG TTAATGCGAT TTATTCAAAA CGTACGAAGA AAACTTCAAG CGTCGTCTGA 
107581 CGCTCTATCC GAGGCTATTG CCAGAATAAA GTTTTTATAT CTGTAACGCG CCCATCTCAC 
107641 TTGCTTTTTT TATTTTGAGA CGAGCGTCTT GTGTCCAGAG TAGTTGCGTC GCTAGGTGAT 
107701 AACGAAGTGG ACCCTATGCC AGAAGACGCC ACGCTCCCGG GTCCGCCGCC ACCGGGCGCG 
107761 GGGCCCATCG AGCCTATCAA TGAATGGGGC CCGCTGGAGA TCGTAGTAAA GCTATTTGAC 

107 821 CCGAGGGTGG AGGCCACCGG GGCAACGCTC GGGGCGGCCG AGCCGGACCC GGACAAAACG 
107 881 CCCGAGATAC TAGAACTAGC GTCCTTTTTG CCGCGAAGGC CCCGGAGGTG GTCTTTGCGC 
107 941 AGAATTCCAT TCTTCTTTTG CATATACATG TCGTAATGAT GTTTGGCCGT TAAAAACACC 
108001 AGATAATTAC GTTTCGCGAT GGCATACTGG GCGGGAGACA TGTCACCCTG GGGAAGGTTG 
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10 8 061 TTCATCTCGG CAACCAGCGG GTGATTTGGG 
10 8121 GGCTCCTTAA CCCGCAAGGA CGACATCTCT 
108181 GGGACGATAT AGACGAGGTC AAATAAAACA 
10 8241 CGGTCGCGCA GTCCGGCGAC GCCAAACAGG 
10 83 01 CTCGTATGGC ATGTGCCGCG TTGTTGGCGA 
10 8 3 61 ACGTGATCTC GAATAGAGAT ATAACCCGCA 
10 8421 CGCGCCCGGT GGTAGCACAG ATCGAAAGCT 
10 8481 CGTTGGTGGG TGGGAAATTT ACGGTTTCAA 
10 8541 CGCACTCGAG GCACACCACC ACCCTCGGAG 
10 8 601 CCACCACCTG GGACCAAACA GGTAGAGAGA 
10 8661 CTTTTTGACC CAGGTCGCGG GATATCTCCG 
10 8721 CACGGGACAT GCTCAGCAGG GCCCTAAACG 
10 87 81 TGCCGCACGG GACGGTCGGC GCCAGTTCTC 
108 841 GGCCCGGGAC ACTCTCATTC AGGTTAGCTC 
10 8 901 ACTGCATCGA GACACAACCG CCCCGGTTCC 
108 961 CGTGCTGTGG CAAACAGGTA ACCAGGGGAA 
109021 CGAGACAACA CGCCCTCTGA AACGACACAA 
109081 CGGCCGTTAA ACTTAAACCT TCGTCGCTGG 
109141 GCACCCACGC GTACGCCTCT AACAACGACC 
109201 CGCCGTTTTT AGGTGGGTTA TCCGGCCTCC 
109261 GAAAGGATGG GGCAGAAATT AACGTGCAAG 
109321 CGAGCCCGTA CGTCACAAAA CACACGTAGC 
10 9381 AGGCCGCGAG ATCTAAGCCC AGCCGAGAAC 
10 9441 CGCGCCCATG ACAGGCAGCT TCGTCTTCAG 
10 9501 CCCAATTGTA CCGATTGAAT TGGTTAATTG 
109561 TCCTTCAACT CCGACCCCGA AGGGCGGGCC 
10 9621 CCGTCGGAGT GACCCGTCAG GGCAAAAACC 
10 9681 CTCTTTGGAG TCACGACCAA AAACTGTAAA 
109741 CATATGACCA TGGAGCTGTA AACGTTTAGG 
10 9 801 TTCATAAAAT CGTCCTGGCC CAGGGACACG 
109861 TTTATAGGCA GGTTTTTTCC AAACTGGGCA 
109921 TTCAGCGTGG TGGCCAAACC GTTGTTGATA 
10 9981 AGTCTCTCTA GTTTGTGGAG CCAAAACTTA 
110041 TGCTGCAGGC GCACGGTGCA CACGGGGACC 
110101 TCCTGACACC TCTCCAAGTC CAGGGAGATT 
110161 TCGCACAGCT CGTCAACTAA ACGTTTTCCG 
110221 CTGTGCGTGG CTCCCGTAAA AACCGCGTCC 
110281 ACGGCGTTTA CCTTGGCGTC CAGGACCATG 
110341 GCACGATGAC AGAGGAAAGT TTTTAAATAC 
110401 TATATGGTGG GAAACATAAT TTTCATGTCA 
110461 TCGTCACCGA ACATCACGTG AGACAGAGGT 
110521 ACGCGACCAG CCACCCCTTC AAATATTTTA 
1105 81 CAACTCAGCT TAAAAACTCC TACCCATTAT 
110641 GAGACACTGG CCCTCTCTGC GGGGACTCGT 
110701 CGGAGCTTTT AACAGCCCGG TATTAATCCA 
110761 GTGCAGGGTG TGTAGCCTCC TGTTCACTCT 
110821 CTTCGAAGAC TACGGCTGGT TGTGCCTCAC 
1108 81 TACCCTCATG GTGGCTGCCG ACCTTTTGGA 
110941 GAAAGATGGG CCAGTATACA CCGCCCAAAG 
111001 CTTCGCAACC CGCTGCTTCC GACCCATCGA 
111061 AAATTTTTTA CAAACCGAGT TTATTAGGGG 
111121 CTGTTTTAAA ACGTCCTGGC CGCGCACAGA 
111181 TTGTTCCGTT GGCCGCGGAA GTCACACCAA 
111241 AGAGGCGTTC AACTCCACGA ACGCCGAGGA 

1113 01 AACGTGGGCA GAATCCCAGC TTCTTGGCTC 
111361 ACCCTCCGCC TTCCCAACCC CAGAAGATGC 

1114 21 CCCAACGCTC AACCTAAAAA CAAAAAACCA 



TAATCGTGCT CAAGGCGTCC CTGAACGATT 
TTTAACTTCT ATTACACTTT CCACAGGACA 
GCTCGGCCAC ACGCAAATGC TTTAATAGGC 
GCACGACGAC GCTAACCAGG GAGTTCGCGT 
GCACGCACCG TAAATAGGGA TCTCCAACAC 
TGTGCGATCC GCCACAATAA GAGCAATATA 
GCTTCTCTTT TTGGTCGCGA CTGAAAAACA 
ATTTACCCCG TCCGAAATTC AAACAGTAAC 
CTGGCACGGT CTTCTCCAGT ACGCTCCTGG 
TACACGGAAA CAGTACGTTA TACGCCAATA 
TCTCGGTCGA CTCCCCTATG GGCAACACAA 
TCAGGCTCCT CAGAAGGGCG TTAAACGGGT 
GCAGCGAGGC CAGAAGTCCC GCGTCCGAAG 
CCAGACGTCT GGAAATGGAC GGAACGTTCA 
ATTTCTTCCG CAAAACGGGC AGATCCAACG 
ACCGCTGGCG ACAGTTAAGG GTTTTGCACA 
CAAGATACCT GGACCACGCG CTCCTGGGAA 
GACAGCCCAC GCCGGTGCAG ATACACCTCA 
GGCCGATATC GTGCAGCCGC GATCTCACGT 
AGCCGGTAGC AATCTCGTTC AGGGCGGTCT 
CCCATTTTGG GGGTCGTCCG TCCCAGGCAC 
ATTCGGGGCA TAGTCCGATT GAGCGTATAG 
CATCCAGCCA ACGATGGGCA GGATAAGAAC 
CCAGGTTAGG CAAGCGGCCC GAGGCCATCC 
GTCATCGCCG CACGCTCTCC GCGCCCAATA 
ACGGAGCCCG TTCGCCTCAA CGCACCGAGG 
GTTCTAAGAA GGGTTTTTAA CCGTTTAGCG 
ACCTGTCGGT GCTCCGTAAA GTAGGTGCGG 
TCTCCGGAGA AAACCAGACG TGCCTTAAAT 
GACGAGTTCC TCTCAAGATA CACGTCCGAA 
TCGGCGTCAC GTGGCTTACA CAAAAAACAT 
ATTACAAAAC ACGGGGCAAA CGGGTAGGCC 
TACACAAACT CGAGATGATA GACGCAGCCG 
GCCCCGCCTT TAGCGTATAC GGGAGCCCCG 
CCAGAGGGTC CCAGGTAAGA GACAACTAAA 
GAACTCATCG TTATAAAGAT CCTTTAGGTG 
GTGCTAACGA TTTTGTGAAT GACCTGTTTT 
CAGTGCTCAC AGTGAGCTGA CCGCGTCTGA 
TGACAGTAGT TAATGGCGTT GAGCCTGGAA 
TCGGGCAGCA GGGACTCGAA CGCCAATAAA 
AAAAGATGCT CACCGCCGGT ACCGCGTAAC 
GCCTTAAAAA GCGGGCCCCC TAAAGTCGTC 
GTTCGTCCCG TGGCAACTGG AGACCCTCAT 
AGAACAATCC TTCCTCCCCG GTACCCCGGA 
CACTCAGGAC TCTCTACAAC CCGCCTCATC 
GGTCCGGACA TTCCCACCCC CCGACTCTTT 
CTGCCTATAC GCCCCCCGAT CATGGACGGC 
ACTAACGCAC GTGTACTTCC CGCAATGCGT 
CATCCTCGGA ATCGACGTCC AGCTGCACTT 
CAGAGAACAA ATACTCCACA CATCTCATTT 
CATGTTAGAA GGCACGATTC CGGGATCGTT 
AAAGGACGAC CAACAACCTA CCGTTGCGTG 
CCGGGATAAC CGCCTACCCG AGGACCTGGA 
AAAGCCCAGC CTCCTCGGCG TCTTTTCGGC 
CGACACACAA CAGGCAGATA CCCATTTACA 
TGACCAATCA CAGGGCCCCT GCCTGATGCA 
CACCGCATCC ATATGCGTTC TATGCGAGTG 
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1114 81 TCTGGCCGCA CACCCGGACG CCGGTCCGGT TCTGAAAGAT CTGCGTCGCG ACATTCTGGA 
111541 AAACATGGAA AACAACGTTA AGCTCGTCAA TCGCATATCG TACATCCTAA ACGATCCGGA 
111601 CTCACTGTCA CACGTGCGCG ACGAACATCT GCGCGGCCTA ATTAAACGGT GCTCGGCACA 

1116 61 AGAAATCCAC AAGCATTTTT TTTGCGACCC GGTGTGCGTC CTGAACACGT ACTCGCACTG 
111721 TCCCGCGGTT TTATTT/^AAT GCCCACCTCC CGAAAAGTAT AAGAAGCTCA AAGCTCGTCT 

1117 81 GGCAACCGGA GAGTTCCTAG ACTGCAACAG AATATTTGAC TGCGAGACCT TACAGACCCT 
111841 GGCCGTCCTC TTTAAGGGGT CTCAACTGGC CAAAATCGGC AAAACCACGT CGCTCGAGAT 
111901 AATCCGTGAA CTCGGATTTC AACTGCGTCG ACACAACATT CAAATCACCC ACCCGTTTCA 
111961 AACCTCCAAC CTATACATTT AATCTTCAGA AGCGCACCAG ACAATGCCAA AACAGCCCAG 
112021 AAGTCGATTG GCGTCTCGAG CGCCGTACGC ACCTAGCGTC AGGCGACCGG ACGGGCCCCA 
112 081 GTCCACGCGA CCGGCATCCA GGCACGGCAG CTGCAAAAGC GAAATCATGC AGTGGAAAAA 
112141 GTTAGTTTCA GACACGCAGT TTTTTTCTGC CCTAACGCGC CGCCACGAGC TGGGGGTGGA 

1122 01 CTTTTTAAGA GAAATGGGGA CCCCGATATG CACCTCAAAG TCCGTTATGT TGCCGTTAAA 
112261 CCTAAAAACC ATCGCCCCGG GTCGGTGCGT CTCTCTCTCA TCATTCGGAC ACTCGTCAAA 
112321 CATGGGGTTC AACTGTTCGT CGTGCACGCC AACTGACAGG TCAGCGGTGT CTCTGGACGC 

1123 81 AAACGCGCTC GGCGAAGATT CCGCCAGGAA AAACAGCGAG CTGTGTTCAG TGGCGTTAAC 
112441 CTTTTACCAC CACGCCGAAA AGGTCGTGCA GCACAAGGGC TTTTACCTGT CTCTGCTCAG 
112501 CCACTCCATG GAAGTCGTTA GGAAAAGCTT CACGCAACCC GGGTTGCTCT ACGCCCACCT 
112561 AGTCCTAAAA ACCTTTGGCC ACGATCCTTT ACCTATTTTT ACAGTCGATG CCGATGAGAG 
112621 ACTCGCACTG TGGGCGGTGT TCCACACTAG AGACCTACAC CTGGGGGAAA CCAGTCTGCG 
1126 81 ACTCATTATG GACAACCTTC CAAATTATGA CATAACGGTG GACTGCATCA AGCAAACGTA 
112741 CATAATGAAG TTTACACCCT CGCGACCGGA CAACGCAACC GTGACGGTTC CTGTCAACAG 
112 801 CATTTGCGAG GCCGTGGCCA CCCTAGACTG CACCGACGAG TTTCGAGAAG AAATTCAAAG 
112 861 GGGCACGGCC ATCATAAACT CCCAGGGGCT ATTGTAACCT TTCCCTAGAC GGAAACAGAT 
112921 GTAATTCACC TATTCAAATG TTAAGTTTTT GTTTTGAACA TATCACCAAT TUU^AACAACT 

112 981 TATGTTTACA CAAATTGAAG AGCGTTTCAA TTTTACGATA AACATAGCAA AAACCACGGT 

113 041 AACTAAAACT CAATAGCGAT ATTCGGATGG GATCCCTAGA GAGCAATCGA CGTGCGCCGG 
113101 TATTAATTGA CGGGAACGCA CGTTAACTGC TCTCCACGAA TCGCAAACTC CGCGTTTTTA 
113161 GGACCATTCG ACGCCGTTAA TAATTGGCAG TAAACTGCGA AAGGGCGTAC ATCGCAGTAG 
113221 TGATTTTACG ATGTGCACAC CTTTCCGCGG TTCACCGCCA ACAACAAAAG CGGGTCACCA 
1132 81 GGCACCTAAA TCGCCCGGGC CTTTTATGCC AAAGTACAAA AGGGACCGGC GTTCTACGTT 
113341 CACGAACAAC ACTGTTGATC GCAAATACAA TAGTGTTGTT GGTGACCGCA AACTGCCAAA 
1134 01 TCCTAAAATG CATGGACATA GTCCTCAAAA TCTAGAACGA ACTTCCAAAA ACATGCGGCA 
1134 61 ATAAACAACC CCTTGCAGTT AATGCACATC GTAACGAGAC GGAATCCAAA GAACACGCCC 
113521 GATCTTAAGA CCACGGGCAC GTGTCTTTGG GTTCCGGGGG CGTTACGATG GAAGTTCATC 
113 581 ACCAAGTATC CCATCAACCT GTTACAAT^G CGTAAAGATA ATCGGTTTAC AATAAACAAT 
113 641 AAAAAATAGT GGACGAAGTT ACATTCCGAC ATACAAGCGG ACCCAAAAAA ACACCCCAAC 
113 7 01 GCAAAACAAC TGGAGACTGT GTCTTTGGGT CCCTTCCACG TCGCAATGAG ATTTCTGGTG 
113761 ACCGTGTGCT GCTTAACGCG TTACAGTTGT GTTTTATGGC TACGTAAACA CACCAATCTA 
113 821 GAATGCTAAC ACGTGCACCG CGCTTAGGCG CGTTTGTGCA ACTAACTGCT GTCAGGTTAC 
113 8 81 TACTCTTATT TTTAACAAGT AGCATACCAA CTGCAATTAA TTGCACTAAA CCCAGTTTAG 

113 941 TCTTTTTAGT CGAGTCGGCC CATAGTCTTC ATGGAGAACC ATCATTCATC TCCGTCAACG 

114 001 GCGTCGGTGT TACCGTTGTT ATTGCGGGCA AAACAGGCTC GGGGGAAAAA AGAGCAACAT 
114 061 ATCGTCAGTT TAAAACAACT CAAGTTCTAA AATCTCACCC ACTCGTAAAC AAAAAACATC 
114121 GCAGTTAAGG TGTATGTACC GAACATAACA ACACAAGTTT TTTAAAACAC AGCTGCGGTA 
114181 AGTAACCCCC ATTGCCACGC GTGTCGCGTG CTAAGTGTTT TTAAAATTAC ATTGTGCGTT 
114241 TTACACACCG AGCAGTAATC TCAGGAGGGC GGTTAACGAG CGATATACAT ATTCCCTAAA 
1143 01 CACGGGAACG CGCGCTGACC GCCCTCCCCA AATCACAACA CGGGACTACA AAGCCTAGTG 

1143 61 TTAATATAAT CAAATTAAAA AACCACAGAA ACCTTTAGTC GTGCGCAAAC ACTAGCAAAG 

1144 21 GTACCTAGAG CTTTCCCCTA TACTTCAAAA AACAGCGGTG GGTTATTTGA CCACACGTTA 
1144 81 AGTAAACACC CGTAAGAATT ATTCCCGTTT ATCAAAATGG AAAAATTAAG GCTTTGCGTT 
114 541 AAAATCTGCT AACGCAAAGG GCACTTAATT TTTCCAGTTT GGACTCGGAA CTTTACCGCT 
114 6 01 AACGTTAAAA TTTAATAATG CAAAAGGCAC ATAATTTTTT TTGTGTAACC TCAAAACTCT 
114661 ACAGCTAAAG TTAATAACGG CGACTTTGGG CAGCGAATCA GTGTCTGTCG CAAACCCTGT 
114 721 TAAATTTAAA CACAACAGGG AGCGCGACTA AACACGGACT AACTGCTCAT CGGGCCCTGT 
114 7 81 TGAAGAAGTT GGTGTTGCAA TGCATTAGGG AATCTAAAAA CGAAATCCTG CTCGGTAACA 
114 841 CGGAAATTGT CTTTAGTTCC CTACAGCATC ACAACAATTG TAAACCATAA ACGTACGCGC 
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114901 
114961 
115021 
115081 
115141 
115201 
115261 
115321 
115381 
115441 
115501 
115561 
115621 
115681 
115741 
115801 
115861 
115921 
115981 
116041 
116101 
116161 
116221 
116281 
116341 
11640.1 
116461 
116521 
116581 
116641 
116701 
116761 
116821 
116881 
116941 
117001 
117061 
117121 
117181 
117241 
117301 
117361 
117421 
117481 
117541 
117601 
117661 
117721 
117781 
117841 
117901 
117961 
118021 
118081 
118141 
118201 
118261 



GTGGGGTTGG 
GGTGCTTGGT 
GGGTGCAGGT 
TGCAAGTCCC 
AAGTCCCCCG 
TCCCCCGGTG 
CCCGGTGGGG 
GGTGGGGTGC 
GGGAGCGGCT 
GGGTGGGGGC 
CTCCGGGTGG 
GTGGCTCCGG 
CGGGGTGGCT 
GCTCCGGGGT 
CTCGGCTCCG 
GCGGCTCGGC 
GGGGGCGGCT 
GGGTGGGGGC 
CTCCGGGTGG 
GTGGCTCCGG 
CGGGGTGGCT 
GCTCCGGGGT 
CTCGGCTCCG 
GCGGCTCGGC 
GGGGGCGGCT 
GGGTGGGGGC 
CCGCAAGCTA 
GCGCTAAATG 
AATGAATCAC 
GCCGTATATT 
GCATTGCGCA 
CCCCAAACAT 
AACCCGAGAC 
GTGCCTGCGT 
AGGCTAAGCT 
ACGGGATGTA 
TACCTGAATT 
CTCATACCAT 
CTGGAGTGAC 
GGTACCAGGC 
TCGGTAATGG 
GCTGTAGGTT 
GTACCTGAAT 
ACTAGGCTGG 
ATATATATAT 
CAACCAGCAG 
TTGATTGACA 
ACGGCCGCGG 
GGGAAACGTC 
ATCGCTGGCG 
CGACCCCGCC 
CTGACCGCCG 
AAACATTTTT 
CGCGGCCCCC 
TGATATGTTT 
TTCCCCCATG 
CATGGGCACC 



TGTGGTGTGA 
GCAACGGTGG 
CCCCCGGTGG 
CCGGTGGGGT 
GTGGGGTGCA 
GGGTGCACGT 
TGCAAGTCCC 
AAGTCCCCCG 
CGGCTCCGGG 
GGCTCGGCTC 
GGGCGGCTCG 
GTGGGGGCGG 
CCGGGTGGGG 
GGCTCCGGGT 
GGGTGGCTCC 
TCCGGGGTGG 
CGGCTCCGGG 
GGCTCGGCTC 
GGGCGGCTCG 
GTGGGGGCGG 
CCGGGTGGGG 
GGCTCCGGGT 
GGGTGGCTCC 
TCCGGGGTGG 
CGGCTCCGGG 
GGCCTAAAAT 
GCCGCACAGA 
CCCTCCCCAA 
CAAGAATACG 
CGAATTGCCT 
ATATTCCCAG 
CATCCAGCGC 
CCACTGAGAT 
GTGTGGTAAG 
GTAATTAATG 
ATTAATTCAA 
AATGCGGGAT 
GGGATTAAAT 
TCATACCATG 
TGGAGTGACT 
GTACCAGGCT 
CGGTAATTTA 
TAATATAGGA 
AGTGACTCAT 
AGGGAACTTG 
CGAGAATTCC 
GAACGGCCGG 
CGACCAATGG 
CCTGCCGGCG 
CGGTGCCACC 
GCCCTCATGC 
GACGACCCCG 
CAATAACCCC 
ACCGTCCCCA 
CAGTGACCCT 
AGTACATGTT 
GGACGGCACC 



AATGCTTGGT 
TGGGGTGCAA 
GGTGCAGGTC 
GCAAGTCCCC 
AGTCCCCCGG 
CCCCCGGTGG 
CCGGTGGGGT 
GTGGGGTGCA 
GTGGCTCCGG 
CGGGGTGGCT 
GCTCCGGGGT 
CTCGGCTCCG 
GCGGCTCGGC 
GGGGGCGGCT 
GGGTGGGGGC 
CTCCGGGTGG 
GTGGCTCCGG 
CGGGGTGGCT 
GCTCCGGGGT 
CTCGGCTCCG 
GCGGCTCGGC 
GGGGGCGGCT 
GGGTGGGGGC 
CTCCGGGTGG 
GTGGCTCCGG 
CCTTACCGGT 
GGTGCGACTG 
ATGGTTACTA 
GCCAACGCGC 
TCCGCACGAA 
CATTCCAACG 
ATAGAGGTTG 
GCTATTCAGT 
GGAGTTCCCC 
AGGCACAGGC 
ACCTATATAA 
ACAGGCTGTA 
ACCAGGGATA 
GGATTAAATA 
CATACCATGG 
GGAGTGACTC 
AACCTATATA 
ATACAGGCTG 
ACCATGGAAT 
TAAACAAAAC 
CTGATTGACA 
CCAATCGCAA 
CCGCCGCGGG 
GCCCCTAATC 
GCCGCCGAAC 
GCACCATGGG 
GCGCCAAAGA 
AGTCTGACCC 
TTTCCCACGG 
CGGATGACCC 
TCGGTGACCC 
ATGTGCACAT 



GCAACTGTTA 
GTCCCCCGGT 
CCCCGGTGGG 
CGGTGGGGTG 
TGGGGTGCAA 
GGTGCAAGTC 
GCAAGTCCCC 
AGTCCCCCGG 
GTGGGGGCGG 
CCGGGTGGGG 
GGCTCCGGGT 
GGGTGGCTCC 
TCCGGGGTGG 
CGGCTCCGGG 
GGCTCGGCTC 
GGGCGGCTCG 
GTGGGGGCGG 
CCGGGTGGGG 
GGCTCCGGGT 
GGGTGGCTCC 
TCCGGGGTGG 
CGGCTCCGGG 
GGCTCGGCTC 
GGGCGGCTCG 
GTGGGGGCGG 
AAATTTAGCA 
CCTGCCAAGG 
TGGTTTAGTT 
CAGCGTCCAA 
TACGGGGCGC 
GGCACGCAAG 
TGGCTGAGCT 
GGGACTGCAG 
TAATTAAAAT 
TGCCGACCTA 
ATTCCACCCT 
GGTTTGGTAA 
CAAGCTGTAG 
CCAGGGATAC 
GATTAAATAC 
ATACCATGGG 
AATTTTACCC 
TAGAATCGGT 
TCAATTAACA 
CCAACTCGCG 
GGCGGACTGG 
CCGAGAACTC 
TTAGTTTGAT 
CCCCTGGCTT 
CCCCGCGCCC 
TGCGATGGGC 
GCGGCCGATG 
CGCGGCCCAA 
TTTGCTCATG 
GGGCGACGTT 
GCCGGCGACC 
CTTTCAGTGA 



CATGGGCGGA 
GGGGTGCAAG 
GTGCAAGTCC 
CAAGTCCCCC 
GTCCCCCGGT 
CCCCGGTGGG 
CGGTGGGGTT 
TGGGGTGCAG 
CTCGGCTCCG 
GCGGCTCGGC 
GGGGGCGGCT 
GGGTGGGGGC 
CTCCGGGTGG 
GTGGCTCCGG 
CGGGGTGGCT 
GCTCCGGGGT 
CTCGGCTCCG 
GCGGCTCGGC 
GGGGGCGGCT 
GGGTGGGGGC 
CTCCGGGTGG 
GTGGCTCCGG 
CGGGGTGGCT 
GCTCCGGGGT 
CTCGGCTCCG 
GTAAATCCAA 
CTCCTGGCGC 
ATGCAACCAT 
CCGCCCTCCA 
AGCATGGTCG 
GGTCCAGGCT 
GCCTATCTGG 
GCTGGGTCCT 
ATTATATTAA 
GGTACCTGGG 
GTTAGGGGGA 
TGGGTACCAG 
GTTCGGTAAC 
AGGCTGTAGG 
CAGGGATACA 
ATTAAATACC 
TGTTGGGGGA 
AATGGAAACC 
AGGAAAATAT 
GATTGGCTGC 
CCAATGGTTT 
GGCAGCGAAG 
TGACGGCTTG 
AAGGGATTAC 
GGTTCGGCGT 
ACCGGCGGCA 
GGTACGCGCG 
ATCTGGGGCC 
GGCACCGACG 
TTGCGCACCA 
CCGGGCGGCA 
CCCGCGGCCG 



TTGTAAATGT 
TCCCCCGGTG 
CCCGGTGGGG 
GGTGGGGTGC 
GGGGTGCAAG 
GTGCAAGTCC 
CAAGTCCCCC 
GTCCCCCGGT 
GGGTGGCTCC 
TCCGGGGTGG 
CGGCTCCGGG 
GGCTCGGCTC 
GGGCGGCTCG 
GTGGGGGCGG 
CCGGGTGGGG 
GGCTCCGGGT 
GGGTGGCTCC 
TCCGGGGTGG 
CGGCTCCGGG 
GGCTCGGCTC 
GGGCGGCTCG 
GTGGGGGCGG 
CCGGGTGGGG 
GGCTCCGGGT 
GGGTGGCTCC 
CGCAGTAAAT 
CTCTTTTATA 
AATACCAATA 
ACCGTGCAGC 
CGGGAGGCTG 
ACCGGGGATT 
GGATTCCCCA 
CCCAGGACAG 
TAAATAAACC 
GAACCCCCTA 
TCCCCACGTG 
GCTGGAGTGA 
GGGTACCAGG 
TTCGGTAATG 
GGCTGTAGGT 
AGGGATACAG 
ATCCCCGTTT 
AGGCTGTAGG 
AATAGAATAT 
CTGTTTGGGC 
GCGAGCATTT 
CAAAAGACAG 
GCCGGCCAAT 
GGCCGTAAAC 
GACCTCGCCG 
ACATTATTCG 
GCGACCATCG 
CATGGTGGCG 
GTGCCCGTGG 
TGGTCCCGAA 
AAAGCCGCCC 
ACCCCGGCGG 
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118321 
118381 
118441 
118501 
118561 
118621 
118681 
118741 
118801 
118861 
118921 
118981 
119041 
119101 
119161 
119221 
119281 
119341 
119401 
119461 
119521 
119581 
119641 
119701 
119761 
119821 
119881 
119941 
120001 
120061 
120121 
120181 
120241 
120301 
120361 
120421 
120481 
120541 
120601 
120661 
120721 
120781 
120841 
120901 
120961 
121021 
121081 
121141 
121201 
121261 
121321 
121381 
121441 
121501 
121561 
121621 
121681 



CGGCACCGCG 
TGTCAGCGGA 
TATTAAAGTT 
CTGCGGCGAC 
GAGGCCGGCC 
CGTTCCGTTG 
GCACTTTCCC 
TCGGTGAACG 
CCCGCAACGC 
GGCGCGGCTT 
GCTGTGTTTG 
CGCCGCCCTC 
CCATGTGCGC 
GTAAGAAATA 
CTTTCACCCG 
GCGTTGGCGG 
AAAACCATGT 
AAAAAGGTGG 
TTTTTCACGT 
CTAAAGTAAC 
AAGGTGGAGA 
GGAATACCGG 
TCGGACGCCG 
CGACCGAAGT 
CCCTCTTCTA 
AATGTGGTCT 
GGTTGCGGCC 
GAGGCTAAAC 
GTTCACGATG 
TTTTGTAAAG 
TAGTTTCCAC 
AAAATCCTCC 
CTTGCCGGCT 
GCGACACTCT 
CTGGTGGGCC 
CACCTCCGTT 
CAGGAGGTTA 
TGGGCCAACA 
GCTCACGGAG 
CCGCTCCACA 
GGACTAGATG 
GCCAACCAGT 
TGGCAGCCGG 
CTATGCTGGC 
TTTTGACGGC 
TGACGGGACT 
ACTGTAGGCA 
CGTGACACCA 
ATCTTCTCTG 
GAATGGGGTA 
CCTCCCCCTC 
TGCATCTGAG 
CGGTGTACGA 
GGTGGTCTGT 
GCGATTGGGG 
TTGGCGATTG 
GTTCAGGTGA 



GGCGCTGGGG 
ACATATTCTC 
ACGTTAGGGA 
GCTGACGCCG 
GGCCAAAGGT 
TCTTCTGCCT 
CGGTGAGAAT 
GGAGGCGCGC 
GAGGGGCCGC 
CCTCCCCCAA 
TGGCACGCGC 
CTCGCCTCGG 
ATCCCCCCAT 
ATTGCTTTTT 
TTGTTTTGTT 
CCACGACCTT 
AGTTTGAGGG 
GTGACTGGTT 
CCCTTTCTGA 
TGGGGCTGGA 
CCAGGTCCAA 
GTTCTCCAGT 
GCCGGTCAAA 
CCACTAACCG 
CCTACACTAG 
TCGTCAAATT 
GAAACGCCGA 
AGCGCGCAGG 
GCTTTGTGAA 
CCGCTTTTAA 
CTTAAAGCCT 
GCCGCGCAAA 
AAAAATAGAC 
AGATAGCGAT 
CTGGCAACAG 
TGTATCGTCC 
GAGAGGGCCC 
GAAGCCATTG 
GCTTAATACG 
CTGGTCCCAT 
CAACGCGGAA 
GACGCGTTTC 
CCGGTGGGGA 
CTGGAAGTGG 
CTTGGCGGTT 
GCCAGGTGCG 
CCTTGCCAGG 
CGGGCCCGGC 
TGGGCGGTAT 
TCGCGGCGGG 
TTGCTCCTGT 
ACGTAAACAA 
GTGGGATGTA 
GTGGCTCACC 
TGGTCCCGGT 
TGGCGGTCCA 
GTCGTGGTGC 



GTGATATGTT 
CCCGGCTCCC 
TTAATTTTAA 
GCGGGTTCGA 
GCGATCGCGG 
GCGGTCCCTG 
AGACAGGAGA 
CGCTTGCGTG 
TGGCGCGGAA 
CGACCCCTAA 
TTAAGCGTTG 
TGCACGCTGC 
CTTATTTCCC 
TTATTTTCGT 
ACCAGCCCCT 
GGCCACGTCG 
GCTGACCAGG 
GCGCCTGCGA 
GAGCTGGCGG 
CCGGAGCCTC 
TCGCCCAACG 
TGAGGGGCAC 
AAGCCACAGC 
CTTATGCGGG 
AGCGTTTGAT 
CCCGCAAGGT 
CGGCCTCCGC 
CGGCCGCCAC 
CAGACTCCAC 
ACAGGGTGGG 
TAAGGATTGA 
AGCAT7UVAAA 
AGGCGGCCCC 
CCAGAATACT 
ACCGCATCCA 
CATAGCAGGC 
GGTCCTGGTA 
TCAAGGCAAG 
AACGTCCGGC 
GTCTTCTGCA 
GCGGCTGTTC 
CTGCAGTCTG 
TTTTTTTAGT 
GGGGCGGATG 
CCGCGGCTTT 
TCCGGCAAAA 
CGATACAGTG 
CGAGGGGGGT 
TTTCTGGGTA 
CGCGGGGGGC 
TGTCTTTCAG 
GGAACTCCTC 
TGGCCAGGAA 
GCCTGTTGCG 
GAGTCGGGCG 
GGTGAGTCGG 
GATGGTTGCA 



CTGATGACCG 
GAGCCCGGGG 
TCCACTACCC 
CCCATTGACC 
GGTCTGACCC 
GCCCCCTGGC 
GTGCCCGCAA 
CGTGAGGTGT 
ATTTCCCCAG 
TTTGGTTTTT 
CCACCTGTTG 
CGAGCGTGCC 
CGACCAGGCA 
TAACCCCGCA 
CTTGGCGGCC 
CGCCTGGAGA 
GCCGCCTTTT 
CCAAGCTGGT 
TCAATGGAGA 
TCGATAATAA 
AGAAACACGG 
AGCCCGTTTG 
ACGGCCTCTC 
AACATGGGTC 
AAATCACGAC 
GGTGGCAACC 
CAATCTGGCC 
CAGGGACGGG 
CACCTGGCTG 
CCCGATGGCG 
TTTTTCCTGT 
GGCGGCCTTG 
CAGCCTCTGA 
AACGGCCAGT 
CGTCCCCAAG 
CGTGGATGTT 
CAGGACGGGG 
GTAAAGAAAG 
TTAACACACA 
GTTCAGCCCT 
ATTTAAGGTG 
GGTTTTCACC 
GCTGAATTGG 
GGTGATGTCG 
GACAGAACAC 
CGCGGACGCT 
AATAGGGGTC 
CGATCCTGGG 
CCTCTTCCTC 
GCGGGGGGCG 
GAGCTTCGTC 
CGGCACGCGT 
ACGGGGGTCC 
TGTTAGGGGA 
GGGATTGCGG 
GAGGGGGCAA 
AAAACGACGA 



CCGCCTTACG 
CCCGCGGACC 
CCCTTAATGT 
GCATCCTCGG 
ACCCGTGGGA 
CCACCGGGGA 
GACATCGACG 
CGCCGGTGGG 
AACCGGCCGC 
AAGTTGTTAT 
CGGAGATCCC 
ACCTGTTTGG 
ACATAAACGT 
ACCGCGATAC 
GGCACCGGCG 
CGGCCTGCAG 
CCATTTGCGA 
CGCCCGTTAG 
GCATCAGGTG 
ATCCCACGTC 
CCTCCAGCAA 
CAAACCCCTC 
TGTCGTCTGC 
CGGTGCGGCG 
GAACCGCGGG 
GACTCGGCGG 
GGAGGCGGCG 
GGCAGACCGC 
TGAAGGGCAA 
TCCGTTGGTA 
TTCAGTAAAT 
AAAGGATTCA 
AACCGCCGAC 
GGGAAAACGC 
ATAACTCGCA 
ACAAAGGACG 
TCAATGGTCC 
CGATATAAGT 
ATCTTCTTTG 
GCCTGTTCCC 
AGTCACAGGC 
TGGCAAGCGC 
CAGTCCTCTG 
TTCCCACCGG 
CGTAATGCAA 
GGATGCTCTA 
TGAGTTACAA 
TCCCAATACC 
TGATGAGGGA 
CGCCGGTTGC 
CTCGTGGGAG 
GGGTGGACCC 
TGGCCGAGAC 
TGGGGCCTGT 
AGGGGGCGAG 
GGACGTCGGA 
AGGCTGCTGA 



TTTGCAGCGG 
CGAGTTGGGT 
GCGGCTCGGC 
TGGCCGGGCA 
GAAACCCCTC 
TGTCTTGTGA 
GGGTTGCCGG 
CCGTATGACC 
GCCGCTTGGC 
TTAAAGCTCC 
CCGTGCGCAT 
TTGTGTCATC 
CACGTCGTTT 
AGTCTCTCGC 
TTAACCGGGT 
CAGGTCACTT 
GAGCCACCGA 
AAAAACTAAA 
TTTGTATGGA 
TAGTAAAAAA 
CACCGGGAGG 
GGGCGTATCG 
CTCTAGGTGG 
CTCGCCGACA 
CGTTACTTAA 
CGGCCGCGAG 
CAGCGGCACC 
CGGTGGCCGG 
AGAGCTGCTC 
AGACGGCCTC 
CCGCCACGGA 
GGTCTCTAAT 
GGGGGATGCT 
TAGCATCCGC 
TGCCCACCGT 
CTTCGTGGGC 
CGGTGGGCAC 
CCTACCTGCG 
CGCTCTATGT 
GGAAAGCGGC 
GGGGCAATTT 
CACTGGTTCT 
TCCATGCGCA 
CGACGGGCCT 
AAAGCCGGGA 
ATGCCGGGAA 
AAGATGACTC 
TGACGATAAA 
TACGGTATGG 
TGCTGCCGCT 
TTGTGACTCG 
CAACCCCCTA 
GGCCCGGTGG 
TGAGGCGTTG 
GACGTCGGCG 
GGCTCAGGCG 
TTGGGCGATG 
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121741 GATTATGCGA CGACGGCGGT GCAGGTGAGT GGGGGCCGGG GTGTGACGGG GGTGATGGTG 

1218 01 GGGGCGAATG CG ATGCGGGG GATGCCGGGG ACGACCCTGA CGTGTTTTGA GGTGACAGGC 

121861 CTGAAGCTAA CACCGGCGCA GACCCGCTTG GCGACGGCGT TGGAGATCCC AACACGATGA 

121921 CGGACTCAGA CCCTGACCCT GATGGCGACC CTTGGGATGA CCCGTGCCTG GATTCGGTGG 

121981 CCGTTTCGGT GTCCGATCCG TGCCCTTGTT CTGCAACCGC CCGCACTTGC GGCCCTGATG 

122 041 TTGTTGGAGC GGGGGGCGCG GGTGCGGCGG CTGGCGCGGC CGTGCCACGC CCCCTGCCGC 

122101 GCCTCCCGGC CTGTTCTCTA GTGCCGCCGC GTCTTCCGCA ATGTCCCCTG CAGGAACTTC 

122161 GCAACCCGTG TCCCGATACT ATGCCGGAAC GATGTTGCCG GCTGCCCCAC ATGTTCGCGG 

122221 TTTACTGCGT GTAAAAAACG GCAGATTAAG TAGATTGCCC TTTCCACGAC CCGCAGCCAC 

122281 CTGAGCGCGC GCACGCAGAA CCTATGTTTA GTGCGATTGC GATTATGTCC GCTAGGTGGC 

122341 GGGCTATTTT GATCCTGCCG CGCACGCATT GACCGTTAGG TGGCGCAGAG CGCCGTTAAC 

1224 01 CGCAGCGAGC ACCGCACGGC GGTTTTAGTT TCGTTACTAT AACGTTTAGT TAAAATCCGT 

1224 61 GGCCGATATT GGGTGGCAGT GTGTAAGTCA CGCGCGCCGC GGCCCCCTTA CCTTGTCGCC 

122521 GGCGAACCGC GGAAAGTCTG TTCCGCCGCG CAGCGCGCGG CCGCTGCCGA AACCAACGCC 

122581 GCTTGAATTT CGCGGGCGGC TGGGCTTGGT GTCAGTGATT CACACAGATG CGATGTGATT 

122641 GGGCGTGGTG GTCAACGTCA CTTGTTAACC CGTAAATCTG GAAATTCTTG TCCGCCGCGC 

122701 CGGTGAGCCG CGGCCCTAAG CCTTATAGTG CTGTGCAGCG ATCCTGAAAC TCGAAAGCTG 

122761 GCCATATAAA CCCCAGTATG GCGTTGCGTG TCGGTGGCAA CCTTTTTGAA AAAGATTTAT 

122 821 TGCCGCCGGG TGTTAAGCAC CAACACCGAC CGTGCGTTTT CAATTATGTC GGGAGGAATT 

122881 ACATTAACGC TGCTGCTGGC GACGCTTGCC ACGGTTCGGT GCGCTCTTCA AACGCACTAT 

122 941 GCGGCGGTCC CCGTGCACTC TACCGCGTCC CTGGGGTGCG TGTTAACGAC ACCCCACGAC 

123 0 01 GTTCTTATCG TTACCTGGCA AAAACAGGAA TCGCCTAGTC CCGTTAACGT GGCCACATAT 
123061 AGTTCCGAAG CGGGCACGGT GGTTCAGCCC CCGTTCGCCG GTAGGGTTGA CATTCCCGAA 
123121 CACAAGTTGA CCAGAACGAC CCTGAAGTTT TTTAATGCCA CCCTGGAGGA CGAGGGGTGC 
123181 TACCTGTGTA TCTTTAACGC GTTTGGAGTG GGAAAGCTGT CGGGAACCGC CTGCTTGACG 
123241 GTTTACGTCC CCCTGTCCAT GTCCGTCACG TTTTACCCCC CGATTAACCC GACGCAGCTC 
123301 GTCTGTCGGG CCGAGGCCAG TCCCGCACCG TCGGTCAACT GGACCGGCGT GCCGCCCGAG 
123361 CTGTGCAGCG AACCTGAAGT GTTTCCCCGG CCCAACGGAA CAACCCTGGT TGTGGGTCGC 
123421 TGCAACGTT^ CGTCGGTGGA CCCCGAAGAC CTTGAGAACG CCACGTGCCT GGTCACTCAT 
1234 81 ATAGGCGGTT TGGCCGCGGC GCGGCCCCTG GACCCCGTGT TTTCGGATCC CCTGGAAGGG 
123541 ACGAGCCACT ACGTGGTGGG TGTGGTGGCA GCGGCCGCCG TTTTAGGCAT TTTTTTAACG 
123601 GGTGTTTTTT TGTATAGGTC TATGTGAGCG CGCGTGTCCC CCGTGTCTAG TGTTTTGTTC 
123661 CCCAGTGAGT GTCTCCATGA CAAATACAAA TTTGAGGCTG GCTTTTTAGG GTGGTTTCTT 
123721 GTGCGACGCT TCCTGTGTAA CTGCATACAC CGGGGTGTCG CCAGGAAACC GCGTCTCCCC 
123781 TTTATGTCCG CTCGCCCTCC CAGAGCGAAA GTGAGAATGG TTCCTGGGGC GTTTTGGCGT 
123841 TGAGAGAGTC GGGCGATGTT GCCGTAGCGG CGTCTGCAAA GGCTCACCCG CTTCTGTTTT 
123 901 TTTTCTTTTT GTCAGACAAC AACATGGACG CCTTGAACAA TAACCTTAAC CTGCTGATGG 

123 961 ATTTTCTGTC TAACTATTCG AATAGCTACA GTAGTTATGA CGACAATATG TCTTACACCT 

124 021 TAGACACGGA ATCCACGCTG TGTCGGCTGA CGGTGGTTTT CCCACCTACC GTTTATGCGA 
124081 TTATATGCTT TTTTATTTTT TGCATTACGC TGTTTGGGAA CGCGTTGGTG CTATATATTT 
124141 T TT TTAAATT TAAAGCGCTC GCCAACTCTG TGGATGTACT GATGGCTGGG TTGTGTTGTA 
1242 01 ACTCCCTGTT TCTGTGCGCG TCGTTTTTGT TCAGCTGGCT GCTGTACGTC GCGCCACAGA 
1242 61 TGCTCACGTC CGCGACGTGC AAGGTGGAAA TCTTTTTCTT TTACCTGTAC ACGTACTTTG 
124321 GCGTGTACAT TGTGGTGTGT ATCAGCCTTA TCAGGTGCCT GTTAGTTGTG TTTTCCCGCC 
124381 GCCCGTGGGT CAAGCACGGG GCCTCCGGCT TTCTCTGCGT GTGTGTGTCT TTAATCGTGG 
124441 CGCTGGCGCT GTCTGCCAAC GCGAGCCTCT ATAGGACGGC CCTGCGTCAC CCAGAGACCA 
124501 GCGAGTGGAT ATGCTACGAA GATGCCGGGG AAGATACCGT CAACTGGAAG CTGAGAATCA 
124561 GAACCACCAG CGCGATCTGC GGGTTTTTGG TTCCGTTTGG GCTGATGGTG CTCTTTTACG 
124 621 GACTTACGTG GTGTATGGTT AAAAGCACGA AGCTGGCCAG AAAGGGAGCC GTTAGGGGTG 
124681 TAATTGTGAC GGTGGTGGTG CTGTTTTTAA TTTTTTGCCT GCCCTATCAC CTGTGCAACT 
124 741 TTTTTGACAC CCTGTTGAGG ACCGGTTTTC TGGCCGAAAC GTGCTACCTC AGGGACGTGA 
124 801 TCAGCGTGGC CATGCACATA TGCTCCCTGC TACAGAGCAT GTATAGCGCG TTCGTGCCAG 
124861 TCGTGTATTC TGGTCTTGGG TCTCTGTTTA GGAGAAGGGT TAGGGATACC TGGTCCGTGT 
124921 TTAGGTGTTT TTCCACTTCA GGTAGTTTAT GAGACACTCA CGCGACACTT GGTTGGATTG 
124 981 TTTGTGTACA TTTATTTTCA TTTTGTGTAC ATTTATTTTC ATTAAAGCGA TCTGACCTGC 
125041 AGACCTTACC TGACGTTTAC TGTCTGTTTC TTATGCACCA GAGGAACAGG GACTGGAAGG 
125101 CCAGCGCCCA CGGGGAAACT GTCATGTCCG CAAGCTCGGG CGGCACGTAC GCCCACTGCC 
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125161 

125221 

125281 

125341 

125401 

125461 

125521 

125581 

125641 

125701 

125761 

125821 

125881 

125941 

126001 

126061 

126121 

126181 

126241 

126301 

126361 

126421 

126481 

126541 

i26601 

126661 

126721 

126781 

126841 

126901 

126961 

127021 

127081 

127141 

127201 

127261 

127321 

127381 

127441 

127501 

127561 

127621 

127681 

127741 

127801 

127861 

127921 

127981 

128041 

128101 

128161 

128221 

128281 

128341 

128401 

128461 

128521 



AGGGGTGGAA 
TGATGGCGGC 
TCGGGTCGGC 
ACTCCATGGC 
GGACGAGGCC 
TTAGCCAGAG 
CGGGGGTTTT 
GGAGCTGGAA 
CCATAAGGCC 
CCACGTAGCT 
GGATGTTCCC 
TGAAGGCGGC 
CTGGCGGCCT 
GGTGGCCCAG 
TGAAGCTGGG 
CCGAAGTGAG 
ACAGACCCTC 
CCTGGCGCAC 
ACACCAGCCA 
CGGCCAGGGC 
ACACCACCAG 
TCGCCTGAAC 
CCTGGGTCAG 
GGTCAGGTGT 
CGTTAAACGG 
ACAGGCACGG 
GTTCCTTGCA 
AACCGACCGA 
TGGTGACCGC 
ATGCCTGTTC 
TGACCCAGGT 
GGCGATATAT 
GGTAATCTGA 
' TGGAGCATCT 
GAAGCTGATC 
GTTCGCCACC 
CGTCGAACCC 
CCGGGTTCCA 
CAACGACGGC 
CCGGTTCCGG 
GAAACTTGTT 
GTTGGCTCAG 
CCAGGGCTCC 
ATAGGGGCCC 
CTGCGGCGCT 
ACCCAAGACA 
GCACGCACGA 
GGACGGTTTT 
GGAGTGTCGC 
GCGTAGTGGC 
CATGGAACAG 
TAAATGAGTC 
AGCTCCTGCG 
GCACGGCCAG 
CTCTCTCGGG 
CGCGCACATC 
CCCGATCGGC 



CGTCACGGCG 
CACGGCCGAG 
GTTGCCCGTG 
GTCGCGCGGG 
CCGAAGAACC 
GCACTCGTGG 
GGGTCCGGGC 
CCCAAACTCC 
CCCATAGCAC 
GTCTTCGATG 
TCCTCTAAGG 
CATCAGCGCC 
CGACGGCGTT 
GTTGTGTTGG 
CCCCCATGAG 
GGTCTGTGAA 
CTGACCCACG 
CTCGTCGACG 
CTGCAGGGGC 
CATCTCGATA 
ACCCCGGGAG 
GTCCGGTAGG 
GTGCGGCACG 
GACGCGTTGC 
CACGGGGGCT 
GGTGCTGCCG 
GGCAAACAGT 
CGCCGTCAGC 
CTCGCAGATG 
TCCATAAGCC 
TTCTGGGCTA 
GGGCACCCGC 
CAGCGGCCGG 
GTCCACGTGT 
GACCACGGAG 
ATCCTCAGGT 
AAAAGTTGCG 
CAGCTCCAGG 
GAAAGGGCAT 
CTCCGTGTCG 
AACTAGGATG 
AATGGTTGGT 
CAGGTGGCCG 
TTCCAAAATC 
CCCCAGGTAA 
GATGATAGGC 
TAACAGCGAT 
AAGAAAGCCA 
TGTTGTGTTA 
GTAAAATCCA 
AATGTTTTGC 
AGCTATGTGG 
GGCCATGACG 
CGGTAATATG 
CAACGGTCTC 
ATATTCTACT 
GACAAACCGT 



GGGTCGACCA 
TTGGCAGACG 
TGGTAGTTCA 
TATCTCACGC 
CTAAACATGA 
TTCCCTGATG 
GTGTCGTGAG 
CCGCAGCATA 
CCAGGGTCGT 
CCCACGGTTC 
TCGTCGGTGG 
CTGGGAAGCG 
GCAAACAGAG 
TTGCACCCGT 
TACCACTGTT 
TAGACTTCAT 
GTGCCGCACA 
TGCTGGGGCT 
GTTTCCTCTG 
GCGGTAACCA 
AGCAGGGCCT 
GTGGGACTCG 
GGGAAGGACA 
CTGGACGACT 
ACCGTCTGAT 
GCCGACGTGA 
GTGTGCTGCA 
GTGATGTCGC 
GCGTATCTCG 
ACGCACTGTC 
ACCAGGCGTT 
GCCATTCTGT 
CCCAGGGGTC 
CGCGTGACAA 
GCCAGGCGGA 
TCTTCTGGGC 
ATGACTTCGG 
TCGTTCACGA 
CCGCAGATGC 
CGCGGGAGGA 
TCGTTCACGG 
GGAAGCTGAG 
ATGACAGAGG 
TCGTAAGCCA 
AGCGAGCGCT 
TGACTGAATA 
GTGGTGGTAA 
CCGAGGACAG 
ATAGCTCTGT 
CACAGGGCCG 
AGCGCACCTG 
GTTACGTTTT 
GCGTATCTGG 
TTCCGTTGTA 
CAGGCGGTGG 
AGTGTGCCAA 
CTCGTCTGGC 



GGGAGGCCAG 

GGTTGCGTGG 

GGGCGATGCG 

CCAGGTACCG 

TGCTGATGGT 

CGTTCTCCTC 

GGGTCTCTCT 

GTGAAAATGT 

TGAGAAGACC 

CACCGATGGT 

AGACGGCCGC 

GGGCACCGGG 

TCAGTTCGCT 

AGTCCTTTCT 

CGTCCGAGAA 

CGTTGTTGTG 

CGGTGGCCCG 

GGATTTCGAA 

ACGGGATGCG 

CCATGCCTCC 

CTGTTAGCAT 

GCAAGAGACA 

CCAGCACGAT 

TCACCTCGGC 

GCTGTTGGGT 

TGGCGAAGTT 

GGAGCCAGGC 

TTAGACGCCG 

CGCCCACTGC 

CGACCCACCG 

CCGTCGCGTC 

CCGTAAACAC 

CCACTTCGCA 

ACTCTTTGCT 

ATAGGCCCCA 

GCACCAGCCA 

CCGATGTTGG 

ACTGAATTCC 

GTGCGGCCCT 

CGGCAAAGAC 

TTTCCCGTGA 

AGACAAAGAG 

CGGGTTCATG 

GCGACAACGT 

GGTTGCCATC 

AAGACCTCCT 

TAACGTTCGC 

GCGTGCCGGT 

GGCGTGCGCG 

ATCTGATTTC 

CACCTGTCTG 

CGCACGGGAG 

ACACCGTATA 

ACGGGAGCAT 

ATTGCAATGT 

GTGTCAGGGC 

AGCTCAGATG 



GTGGCGCCCG 

GTAGTGCTGC 

CTGCTGCTGG 

GCCGTTGGCC 

GGTCTGGGGG 

CAGGTGGATG 

AAGAAGACCG 

ATCCGCTCGG 

CATGATCGCG 

CAGACGGGAG 

GACGTCGAAC 

GGTGACCAAG 

GTTTCTGCAA 

GTAGAGTTCC 

CGAGGTCCAG 

TGAGATGACG 

GCAGTCAAAG 

GATGACGCCC 

AATCTGTAGG 

GTCGCTGACG 

GAAGAGGTTG 

CAGGTGCTCA 

GAGGTTACCG 

GGTGGCCGTG 

GGCGCTGATC 

GATGCTCAGG 

CTGGTCGTCC 

CACGTGCGCT 

CGCGTTGATG 

GTTCCTGCCC 

CTGCCTGGTG 

CGACGTGTGG 

CTGCTGAGCC 

GCCAACCGTC 

GTCGGTCGGC 

GTTGGAGGAA 

CTGTCTGATC 

AAGCTCTTCG 

CCTGATGGCA 

CACGCAGGCA 

GATGTCTGGC 

GCGCAGGCCG 

GGCGCGATTA 

TTGGGTGATG 

GCCGTCAACG 

GAGGAATGTG 

CTCGCTGGTG 

GCGCGCCAGC 

AGAAAATGCG 

GCCCCCGGGC 

AATGGCGCTG 

GCGCTGGAAA 

GCTGTTGCCG 

CACCGCGGCG 

CACCGGGAGC 

GGCCAAGGTG 

TCTGCCCACC 



TCCGGTGACG 

GCGAACATCC 

TTGAGATGGT 

CACCCTGGGA 

ATGTGGAGGT 

TCCCACTGGT 

AGCGCCCCCA 

CGGAAGAAGG 

CATCGGGCCC 

AATCCCCGGA 

CCGACGTTGG 

GCGGCCACTG 

ACCTCGGCGA 

TGCGCGGGCG 

TTTGCCGCCA 

ATTCTTTCTG 

TTTTGGCACG 

GGGGTTTCGG 

CCTCGGTTAC 

TCGTGGCCGG 

GCCAGGTGTG 

AAGGTCGATC 

GTGGCCTTCA 

ATGACGACTG 

AGTTCTTCCG 

TCCCTGCATA 

TCCGGGTTCC 

AGCATGATGT 

CCCATCTTGT 

CTGCCCTGGG 

ATCGGCTGGG 

TTGACAATGT 

ACGAGTCCGT 

GGACATCGCA 

GAGGGTGCCT 

ACGGGCTGCT 

GGCCACGCTC 

CAGGTTTCCC 

TCCAACGGGC 

GACACGTTGA 

GTGGCGGTCA 

CCCCGCGAGA 

ATGCTGGTAA 

CGGGTAATAC 

GGTTCAAAAG 

TATGCTTGAG 

GTGGCGCTGT 

TGCTGGCAGT 

CCCACTGACG 

AGCAGCGTGG 

TGCAGGTGTG 

ACGCAGCTGT 

TTGGAGGCCT 

CATATTGGGT 

TGGCGCTGCA 

CCCGGGGGTA 

TCTAACCGCA 
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12 8581 ACGTTCTATC GAGGCCGCTG TCTCTAAGGC ACTGCACTAT TTCGCTAGAT CTTGTGGTAG 

12 8641 GTTGGCGCAT CAGGTCTGGG CCATAGGGCA GGGATATAGT ATGATGGCCC TGGAGGTGCG 

1287 01 CCGACCGAGT TTCCAGAGGT GCATAATTAA CCAGAGGCGC CAGAAGATGC CAGATAACGC 

128761 CCAGCGCTGG GTCGTCATCG TCGTTGTCAA AGTTATCAGA GGGGCGTGTT ACAAGCCAAA 

12 8 821 GTAAAAATTC ACTAAAGCAC TGTTCTGGCA CAAGCGCCAA AACACTTTCT GCGTCCGAAT 

12 8 881 TGTCATGAAT AAAAGCCTCC TCTTCTGGAC TAAGGGCTGC AGCGGCCCAG CGAGGGTTCG 

12 8 941 TGCGTTGGGC CATATTTGCT AAGTCTAGCC GGCGCAAAAG CAAGCGGGTG GGACTTTAAT 

12 9001 AG<SC^GTATA GGGGTTGTTA GGTATGCGGT GGGGAGCCGT CCATGTTACA TGGACTTCAA 

129061 TATTAATTTA TGTTAGTTTT GGGCACCTTG GGATGTTACA CTTATAACTT CCCCCATGTC 

129121 ACAGTAATCA CTTGACAGAT CCCCTTCTAA CTCATAACAA CTGTTTCGGG GTGTTGAGGT 

12 9181 TACAGGTAAG GGTGGGAGGG GAATGTTAGG TGGCGAACCA AAGCATAACG TCGCGGGGCG 

129241 GGAGTGGGAG CCTGTAACCC CAGAGATGTC ACTTACGCTT GTTAGGCGAT CGACAACTAC 

12 93 01 CTCCGAAAGT GTGTCCTGTT GGATGTTTTG TGGATTAGGT AAAACAGAAA GTTGTGGAGA 

12 9361 AAGTAACCTT ATAGATGACC TCCTGCTGGC GCGTGGCTGG GTGACAGCTG GTTGTTGATG 

12 9421 AATGAACACT TCCTCGGGCG TTCCTGGGGT GGTGGGTGTG GTCTCACTAA GGACGGCCAA 

12 94 81 CTCAACACTA ACTTCCTCCC GTTCAGACCG TGAGCGGTTT CGTCTGTAGC GTCTTGTGGA 

12 9541 GGGGGTTGCC TGAACCCTTG TTGGTGTAAA CAATCCAGGT AACCGGTAAA TGCACGCGGT 

12 9601 CAGGGCGATT TGTAACAAAC CTAAAAACGC ATAATAAGCT TACAAACGCA TTTTTGGTAA 

12 9661 CAGTTTGTAT AATTTAGCCC CAAAGTTTAT ACAAAGCGAA ACTACTTACC GATTAGTAAA 

12 9721 ATGCAAACCG CAAGAAGGAT AACACTCACT GATTCTGTTC CAATTGGTTC GAATTTAAGA 

12 97 81 ATGGAGAGTA AATTTGCCCC GCTGAGAATG CCTGGAAATA AGAGGCGCGT ATTTTACATG 

12 9841 TGGTACAGGG TAGAAAACCT TGTAGGTTTC AGTTGAGTCA CCCTGCCACG TAAACCCGCC 

12 9901 TAGAGTGCTG CACGCGGTGT TGGTTGCAGG AAGTACATCC TAATTTTAAC TCTAAAAGGG 

12 9961 GTTCCTTGTA AAATTATTTA AGTTATGGTG TACTTACATA ATATCCAATG AGTAGTTACA 

13 0 021 CTCCAAAAAG ATACTCGATA AAATGCGTGA TATGTGAAGT TTGCAACAAA TGCAGCGGTG 
13 0 0 81 AGTAGGCCCA AGAAAATGAT AAACCCTATA TGAGGAAATT TTATTAGTAC GTTTGTGTAA 
13 0141 CTGCAAAATA AAATTTTAAA TAAATACTTT ATATTCATGT TACCAAGTTT AATTTGTGGT 
13 02 01 GGTTGTTCTT GAATTATATG TACTCCTAGT TCTGAGGCAA CTGTATGTAG TGCAAGCACG 
13 0261 CTGCCTTGCA ACATAACCAA TAGGCATGTT CCATGATTCG ATATTCGCCA TGCCATTTTA 
130321 AGTGCCACGC AGATTGCAGC AATAGCTATA AAAAAACAAA AGTATGTTAA TAGTCTATAA 
13 03 81 AGTATT^AAAC ATTAAAACAA AATTAAAAAA TTAAATACCT GTATTTGCCG TAACTATAGA 
13 0441 GTATGATAAT AGTAAGGTGT AAGACTGTGT TGTTGTAGTT TCTGTTAATG CATAAAGTCC 
13 0501 CACAAGTGCT AAAATAAAAA AATAAATGTT AGTTATTTAA TTTTTAAAAG TTTATACTAT 
13 0561 TCAAGTAATT CAAAGTTGTG TCATACCTGC GTAAATTGCA AACCAAGATC CAATTATTCT 
13 0621 TAGTATGTTT GTTTTTCCAC ATAGCCACAC ATAACCATAA AACACAATGG GCATTAAAAT 
13 0681 TACTGGCAAC AGCGCAGATA CTTTGTAGTT TGGAGTCCAC ATAAGAAATG CCGTAAGAAT 
13 0741 AACAGTGCCT ATCCATGTTG GTCCCACAAG AAGACATGAT GTTGTTTTAC TAGAAACAAT 
13 08 01 TTTCGGTGGT TTTATAACAA ATATTAATAT GCCTGAAAAA TATAAACAAA CGATTAATTT 
13 0 861 AAGGCCAATA TCATAGTAAA ACAAACTTTA AACAGGTAAA AAGTTACGTC TTACCTATGT 
13 0 921 AAAATTGAGA TACGCATGCA ATAGCACTAC ACGTCTGTAC CAAAATACTT GCGTTGTCCA 
13 0981 AGGCCAATGC CGATAGTAAA ACACAACATG CTGCAAACAA GTACACAAAA TATAAGTTTA 
131041 ATTTGCTAAT TAAAATTAAT ATTTCAAATG TATTATAAAA TCATACCTGT AATCCATGGG 
131101 AAAAAAATTG AAGGCAACGG AGTTAATTGA CCTGCTCTGC TTCGATTACA ATACAAACTA 
131161 ATAGTTTTCC ATAAAAGAGA ACAACACGTA ACAAATGAAC AAATAATTGT GACTGCTTGA 
131221 TTAGAAGTGT CCCTGGCATG TTGTACCAAG GCTATTATCC AAAACGTTAA TGTAAGAAGG 
1312 81 TTTGTAAGTA ACCAACTACT CCAAATAAAA AAGGTATAGC TACACGCCAT TTTAAAATAA 
131341 TCAAAACTGT ATTATCAAGT TAAGGGGGAG TTTGAGCTTT AAAAGGTTGG TTACATCGTT 
1314 01 TCCAGGCAAA CCTCACACTG AGTCACACAA AGGTTTCTGT GTGCCCCTGT ACGGGAATCT 
1314 61 GCCTTATTGC TGATTTTGTT GGCAACCAAC CAAAGAAATC ATGGAAAAGC AGAGTGGGAA 
131521 TATTTGTTTT TGTTCGTTGG CCTGGATGCT TGGAAACATT CCTGTTATTT TGATGTTAGG 
131581 CCAAACTGTG GAAAACATTA AGAGTTGTTA TTGAAAATAT TTTTTGGTAA CAAAACCATG 
131641 TTTTAGGTCC CAAAAGGAAC ATAAAAATGT TTAATAGAAA CATGCCTAGT ACAGAACATT 
131701 TTGTTTGGTA CGTAAACTAA AAATATGTAT GGTAAGGGCT AGGGGTAAGG GCAAGGGGTA 
131761 AGGGCTAGGG GTAAGGGCAA GGGGTAAGGG CAAGGGGTAA GGGCAAGGGG TAAGGGCAAG 
131821 GGGTAAGGGC AAGGGGTAAG GGCAAGGGGT AAGGGCAAGG GGTAAGGGCA AGGGGTAAGG 
131881 GCTAGGGGTA AGGGCTAGGG GTAAGGGCTA GGGGTAAGGG CTAGGGTGAG CTAGGGTGAG 
131941 CTAGGGTTAT AGTTATAGTG TAAAATAATA GGCCTACGAT AAAATGTCAA CATATATTTT 
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132 001 TTAATTTGTG TTTCATTAAC AGCCATGGTG TATTAGAGTA AAGGTTAACG CTTCAATACA 
13 2 061 ATATACAAGT AAAAGTAAAC CACACGAATT TATAACATAT TTACAAAAGC GAACCAAGCT 
132121 GGCACATGTA ACTTCACGCT ACCCTAAACG TACAATACTG GGACTAGAAC CCAGAGGTAG 
132181 TTAGAATATA CGGTAGTTAC AGAACTTTGC AGTTCCCTTA GGCCAGCAGG GCTCTGCGGT 
13 2241 TAATTAAACA AAGTTTAAAG ATAACTGAAA CTTTAGGAAG TGCGTATGGG TGCAATGTGT 
13 2 3 01 TCCAAATAGG GCAAGGGTTA CATAAACTGT TGCCTAGCGG CCGGGCCCGG AGGCGCCCGG 
1323 61 CCGGCGCCGC CGCCGGGCCG CGGCCGCCAT CTTGCGCCCG GGGCGAGGGT CCCCCGCGCG 
13 2421 CCCCCCGGGC CCGCGCGCCG GGCGCCGCCG GCCTCCCCCG GCTCCCGGCC CTCCGCCCCG 
13 24 81 CCGGCCCGGG CCCGCGGCCG CCGCCCCCGG CCGCCCCGCG CCGCGCGCCG GGCCGCTTTC 
132541 GGTTCGCGGG GCCGGGGGTC CCGCGGGGGG CCGGGGCCGC CCCCGGGTGC CGCTCCGCCG 
132601 GGCCCGGCCG ACTCCCGGGA GCGCCCCGGT CCGGCCGAGC CCGGCGGCGC CGCGCCCCGA 
132661 CGCCCCCCGG GCCCGGGGCC CCACAAGCCG CGGCGCGCGG CTCCCGATGC CGGGCGGCCG 
132721 CCGCCCGGCA TGGCGGTCCT CCGCCGGCCT CCCCTCCCCC ACGCCGCCCC GAAAGGTGGT 
132781 CTCCGCGCCG CCGGGAGGGG GGCCGGGGCC CGGGGCGCCT CGGCGGGGCC CGGCGCGGGG 
132 841 CGCGACCGAG GGCCCCGGGA GAACGGGGGA TCGGGAAAAC GCGAGGGGAG CGGGGGACAG 

132 901 GGGACGGCGT GTGCGTGCTT GTGAGACACC GGGTACGGCT GCCTGCCTGC TCGCTGGCCT 
132961 GCTTGCTGAG GGGACAGTAG GCCTGCTTGC TCGCTGGCCT GCTTGCTGAG GGGACAGTAG 
13 3 021 GCCTGCTTGC TGAGGGGACA GTAGGCCTGC TTGCTCGCTG GCCTGCTTGC TGAGGGGACA 
13 3 081 GTAGGGCTGC TGGCTTGCTA GTAGGGCTGC TCGCTGGCCT GCTTGCTTGC TCGCTGGCCT 
13 3141 GCTTGCTTGC TCGCTGGCCT GCTTGCTTGC TCGCTGGCCT GCTTGCTTGC TCGCTGGCCT 
13 32 01 GCTTGCTTGC TCGCTGGCCT GCTTGCTTGC TCGCTGGCCT GCTTGCTTGC TCGCTGGCCT 
13 32 61 GCTTGCTGAG GGGACAGTAG GCCTGCTTGC TTGCTAAGGG GACGGTACGC CTGCCTGATG 
13 3 321 GCTTGATAGT AGGGCTGCTG GGCTGCTAGT AGGGCTGCTG GGCTGCTAGT AGGGCTGCTG 
1333 81 GGCTGCTAGT AGGGCTGCTG GGCTGCTAGT AGGGCTGCTG GGCTGCTAGT AGGGCTGCTG 
133441 GGCTGCTAGT AGGGCTGCTG GGCTGCTAGT AGGGCTGCTG GGCTGCTAGT AGGGCTGCTG 
1335 01 GGCTGCTAGT AGGGCTGCTG GGCTGCTAGT AGGGCTGCCT GCTGGCTTGC TTGCTTGCTT 
133561 GCTAGTGGGG CCGCTTGCCT GCTACTAGGG CTGCTGTGCA GCTGGGAGAA CAGAGTAGGG 
13 3 621 CTGCCGGCCA GCTGCGTGCG AGGGCGTCCG AGGGCCAGAC GAGGACACGG GACCCGGGCC 

133 681 TCTCCCCCGC CCGGACCGCC GGGCACCCGG CCCAGATCT 

SEQ ID NO 2 

CDS nucleotides 1353-2624 
SEQ ID NO 3 

MFVLVLFMLLQPVSVELLPAKLTSVPTWCPPHPGDTyLLTCRGTSTARDQRSTQWFRNNTI.MRGSNFYGRL^^ 

SDRYACQTKTTTRSNNIDFRVSSSRLTLQERCSSYGYTYANNTRVLRCYSGGNVTLRNVVFHLNGTAVINGTTTNIHTFV 

LTEKTGGTYFCSAFIGNEKFYSQTINVFFTSFTFKPTNDIPNESHFNKTGQIQQTASVQHPENYWFSVPVFSIGVLTGI 

AISLIMCWLFTIRCNENSESSTNSYASQTSYIQPSHNQRSNTNECSRHTYRNAHQEESIEELPNQHTSETDSCCQLVLLE 

VKNVAYDGPQENTINEVMEQYDDVVVKNIEQTSYEDNVEHMDYSDTINPNFNYYSGLIIiEEVDEVFW^ 

NLDHNEYNHLNELNMIEQYDWIiE 



SEQ ID NO 4 

CDS complement (2692 3258) 

/note="dihydrofolate reductase; ORF 2; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 2" 

SEQ ID NO 5 

MDITVNCIVAVDEQLGIGKNGTMPWPYLRNEMMYFQKMTSTPSWGEKWVIMGKRTWFSIPEKra 
REPPHGAHFI^TLDDAFNFYRQYKLKEQLNTVWVIGGKSVYESVLNYKCPLK1*YITRIMESFDCDVFFPSINFT 

SE I PGKDTNFEENG I KYKFQVYEKNFNK 

SEQ ID NO 6 CDS 3676.-5613 

/note= "complement binding protein; ORF 4; similar to 
Kaposi's sarcoma-associated herpesvirus ORF 4" 

SEQ ID NO 7 /translation="MTFKLFPLFLLHAIMYVHCDENCKPPHFTEYRVKSNTEKDLySV 

GETAEIilCRPGYVTNTKIITTECLQNGTWSTPNFPCDRKRCPTPADLLNGAVHIHGGD 
NALKFGSNISYECNEGYDLIGSbAmFCILQDTENVNWDSNEPVCEIQKCIKPPAVEHG 
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DYLPNQDVYNYGDAITFKCSLSYTLVGSTTLVCTSNKKWSNSFPTCLMLVCESPQIDN 
GYIDIGLSRRYNHGQSITVKCSDGYNIVGPETLTCTNTTVA^PIiPKCVLVrmJPSTPM 
PETPMPETPTPDYQKINLSTAKTATTPNAFVTTWSPEKDDVTCVKPHFERFMVKAEN 
DKEKYSVGASVELICRPGFTKMQSTVSVECLSNGTWTAPNAKCHRKKCPTPQELLNGE 

yivtsgedafkygtnitykcnegyqllgsmvricmlkddlktvdwepkapicdiekck 
pppqitngkyhpvkdfyqyij:)tvtfscnrdfslvgdemttcisntwnkpfprceqit 

SAPNIAHGKLLTGSSSVYKYGQSVTIGCETGFTLIGSEISTCKDSSWDPPLPTCVPAV 

smpsdtpkpetkkpntptpeapkpntpnvgthtpfkpppqnppiappmskwkrhwlv 

LFASVASLLFVLAAIiYCCFLK" 



CDS 6045 . . 9443 

/note="ssDNA binding protein; ORF 6; similar to Kaposi's 

sarcoma-associated herpesvirus ORF 6" 
/translation="MASKGNAGQPLEDNQGSRAPIGACGYVYAYSKQDFPFAEASILG 

NRPSGSGVFSLPILYGLTVEHEFPLTVKAAYKKVDTTTIJ^VKVTCFHREVIVFHNASL 

FRPVFDGTGIiNELCEEARALFGYTQFIEPGPPHSIWNPLECPQIiPDKDEMFLGWVTE 

gfkerlwrgclvpavfqtqqvqiagrqafkvplydedlfaphghrmprfyhkdvsayl 

YDSLFTSIAQALRLKDVTAVIHATEKQFMQDHYKIAKIVQAKQFSTTLPKTTDGSSHM 
IVDSWAEIALSYGCMFLECPQDACELLNYDSWPIFDGCDSPEARVNALERWSAEQAV 

hvagqlfaansvlyltkvqkqaprgqkgdvnvynsfflqhglgflneatikengseaf 

KGVPSNALDGSSFTPYHLAYAASFSPHLIJUaiCYYMQFLQHHKSSTNQAFNMVHYVGT 

aansemctlchgntpatclntlfyrlkdrfpavttpqrrdpywtgtagtfndleilg 
nfasfrdreedgnpadehpkytywqlcqtvteklsaigitedhdnhvnlttniqsflr 
vfkgidsivdgevmkfwsmiknnfnfrehvksvhhilqfccnvywqapcavflnlyy 

KS LLW 1 1 QD I CliP YCM lY EQDNP AMGI LPSE WLKMHFQTLWTNFKAACLDRGVLTGCE 

lkxvhrdmfcdffdtdagsnglmapfkmqvriarammwpksikiknriifsntagse 
avqsgfvkptgtrdtywagpymkfijjslhrai,fpdtktaalylwhkisqtnktpvlk 

DVPDDELAELVSYVKTNSIAFEETNVLDWPDSLMSYARIKIiNGAILRACGQIQFYAT 

TIiHCLTPVLQTIDAEEYPHVLGSAAIATPVAYLAEIRGRTALTVQTTARQPVAATGRL, 

RPVITVPMVWKYTGWGNNNVFHCGNLGYFAGRGVDRNLWPESSPFKJCTG 

RHVMMTPIIDRLIKRAAGQTISTFEAESVKRSVQALLEDKDNPNLLKSVILELIRHLG 

KGCQDLSSEDVQYYLGDYCMLTDEVLFTLDNIAQSGVPWTIEDAGALIEDRQDADDLQ 

FVDSDDIATASCQPPEEQLPTPSAGALL-AGKKRKINALLSDLDL" 

)S 9468 , . 11528 

/note= "transport protein; ORF 7; similar to Kaposi's 

sarcoma-associated herpesvirus ORF 7" 
/translation="MARELAALYAQLSALAVDLSLVIFADPRSIDGARILKTKTQIEN 

LNRDLLPIiLREQNSVETSSLSLEVEHIAKNIEDKLGELERSIiRQRYSSREHFETLHLR 
PECHYHSTVTFQFYGGGLIDVNMCLINDVELLCKRLGSVFYCIGANEALSGLNRVLTF 
LSTLRGISPIPHPDLYVTSVPCVQCLREIELVPNQGSSLLAVLADRHCDHLCKKVRAE 
PIHGLFETELSQLGLKVTKRSDATQHGVRSSADQLRESSLAAIQDHNIFKRVSASIME 
LSNLIYWNAGQTGLQTGTENECSQMARLLTHEADMHEHRALITPKLSATHFYDCFRPD 
PIESLFCGGLFNSIDDTIN;a,SRDCSVTFFQQANYTNVMRKQNELFTRLNSILRQGSA 
GS QKP ATP S E PRTTTVAATAASDVI KD AQYRKEQYMKKVARDGFKKLTECLQTQS AVL 
ANALCMRVWGGVAYGEASELVNHFLLRRRFVALPWEARCRSDQILFENSKYIKNSLYS 
QRLSREHVEIITLQFYGLITGPLTRQSDLFPGPANVALAQCFEAAGMLPHHKMLVSEM 
IWPQIQPKDWIDQTFNRFYQLPEGDLNAVQKSAWCFIRELVLSVALYNRTWEKTLRIF 
SLAREKLSISNLDVKGLTSGLYLTYEQDAPLVLISQNTGWIFKDLYALLYHHL.QLSDG 



HDDN 



CDS 



11515, .14004 
/note= "glycoprotein B; ORF 8; similar to Kaposi 
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PHIFKVRRYRKVATSVTVYRGWTETAVTGKQEVIRPVPQYEINHMDTTYQCFSSMRVN 
VNGIVNTYTDRDFTNQTVFLQPVEGLTDNIQRYFSQPVLYTTPGWFPGIYRVRTTVNC 
EIVDMIARSAEPYSYFVTALGDTVEVSPFCHNDSTCSVAEKTENGLGARVLTNYTIVD 
FATRQPTTETRVFADSGEYTVSWKAEDPKSAVCALTIiWKTFPRAlQTTHEASYHFVAN 
D\rrATFTSPLSQVTNFTGTYPClJroVIQKTLNATIKKLSDTHATNGSEQYYETEGGLF 
LLWQPLTPLSLADEMRELNGTTPAPPTTTSTANRVRRSVGTNEQATDDLAAPQLQFAY 
DKLRASINKVLEELSRAWCREQVRDTYMWYELSKINPTSVMTAIYGRPVSAKFVGDAI 
S\n:DCVAVDQASVSIHKSLRTSTPGICYSRPPVTFRFLNSTTLFKGQLGPRNEIIL.TD 
NQVEACKETCEHYFIASNVTYYYKDYVFVKKINTSEISTLGTFIALNLSFIENIDFRV 
lELYSRAEKKLSGSVFDIETMFREYNYYTQRIAGLREDLDNTIDLNRDRIARDLSEIV 
ADLGDVGRTVVNVASSVITLFGSIVSGFINFIKSPFGGMX.MILVIVAVVLIVFALNRR 
TNAIAQAPIRMIYPDIDKMQPSGGKVDQEQIKNILAGMHQIiQQEERRRLDEQQRSAPS 
LFRRASDGLKRRFRG YKPLENEEAQE YEMS K " 

SEQ ID NO 14 CDS 14122 17166 

/note="DNA polymerase; ORF 9; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 9" 

SEQ ID NO 15 /translation="MDFFNPYLGPRGPRPHSHRGTDAPAPAGAGAVQPPPDVCRLIPA 

CLRTPGAGGMIPVTIPFPPTYFENGARGDVLLiANERSMWTARDRKPVAPDPQPQSITF 
HAYDVVETTYAADRCAEVPSRFQTDI I PSGTVIiKLLGRTEDGTSVCVNVFRQQVYF YA 
KVPAGINVTHILQQALKNTAGRAACGFSTRRVNKRILKTYDVAEHPVTEITLSSGSML 
STLSDRLVACGCEVFESNVDAVRRFVLDHGFTTFGWYSCARATPRLAARDARTALEFD 
CSWEDLSVQADRSDWPPYRIVAFDIECTGEAGFPCATRDGDAVIQISCVFYTTREGAP 
NPPNILFSVGTCDPIPDTDVLEFPSEYDMliVSFFAMIRDFEVDFLTGYNISNFDLPYIi 
ITRASQVYKLRLNEYTKIKTGSIFEVHEPRGGGGGFMRSVSKIKIAGIVPIDMYQVCR 
EKLSLSDYKLDTVARQCLGGKKEDVSYKDIPPIiFRSGPGGRAKVGSYCVMDSVLVMDL 
LKMFMIHVEISEIAKLAKIQARRVLTDGCX?LRVFSCLLEAAARENFILPVPTPEGQGG 
YQGATVINPIPGFYDEPVLWDFASLYPSIIQAHNIiCYSTMIHGRDLHLHPNLTPDDY 
ETFVIiSGGPVHFVKKHKRESLLGRLLTVWLEKRRAIRRTLAACDDPSLKTILDKQQLA 
IKVTCNAVYGFTGVASGLLPCINIAETVTLRGRTMLEMSKSYVEALTTEDLRTRLGRE 
VTARHGARFRVVYGDTDSLFIACDGYSAEAVSAFCDDLAARITADLFPPPIKLEAEKT 
FKCLLLLTKKRYIGVLLNDKMVMKGVDLIRKTACKFVQERCRAILDLVLHDPEVKAAA 
RliLCKRPPHAVYEEGLPAGFIKIVEVLNASYLDLRNSWPIEQLTFSTEIiSRPVCDYK 
TTNLPHLAVYQKLASRCEELPQVHDRI PYVFVDAPGSLKSDLAEHPDYVRQHQI PVAV 
DLYFDKLVHGAANILQCLFGNNADTTVAILYNFLNVPYKliFS" 

SEQ ID NO 16 CDS 17261.. 18511 

/note= "unknown; ORF 10; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 10" 

SEQ ID NO 17 /translations "MLVNELSWLGDWEVTFHRGRFSFVNLTRLQTFKGHGGYARVRL 

PFSLDQL.LHQHFAFGLVTRLKELPPFSDCVALIAPLDSGGDADAARVAPGFVLDSSRP 
LTVWVNASGRHTIRFCLLFLKPIDLERAVTYVFGENGGARSEGTPKPTCATESLPGGP 
LRVSGEASQTSPHSFVAYFPTANSVACLSLLRLQVRPFSDDAAHRDARISPKYVTFSN 
SGGNVCKASVHTLSPSRCKTAQMEIIYAPGDPNAEXVLGQSGPVLPTHTGGRVLGVYA 
DAEKTIQPGSSAEVRVQLIFQQGAAARGDI^FLVTGVAPEPLFVVTPALLLSGCTTHL 
RLFNPNGTPTTIKRDTLVAAAAPCPWRLS S ADDAPRDLVAS PDTGALS INAFTIPVG 
FPGWSAECHVSLRDNGVHERMNH " 

SEQ ID NO 18 CDS 18520.. 19749 

/no te= "unknown; ORF 11; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 11" 

SEQ ID NO 19 /translation="MGTPVRFFRGEWQTSSLVDNGTPRYSSLVWAATIHDGYLTLVNR 

SELCVTERSPCLPACPSIGRLVGKRFPGFAFASATLGDRGTRTVFYAFGHRDNPLDIV 
PAVVERADRELVLRVHAPQTTRVSRYGLKVFVAIVTVVRPPGVFLHFPQDRVPIALTD 
ACSQEGSRLTSEEPWIKIQGFPVLSDETAHPFLLTQKTKPFTERKFCRLIMDNDQRSA 
VNTVYLGKQHVRVTVTRPPETIVTDGPVTATLSLTGNAPIAFRHNPYFELPWSSTTAI 
FTPWYVGLTVCIPPNCSKFVRYGNTYVSAFNRKLTAIISNHAHNGGFRIQDCEWPPN 
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REIEILVTNVSQAPVYISTGTQL.GQAIFVFAPRFGGPAKLRQLLGHRSRALELPGGVT 
VDSQKLCRFETMYIiFST " 

SEQ ID NO 20 CDS complement (19921 .- 20544 ) 

/note="R2; similar to IL-6" 
T-n no 21 /translation="MFPVWFVLFYLSCWAASPTLAPPPTAAGINVLPQWAGNRASLDR 

SEQ wu ^ T^RGj^SEVGLNIQRWFVYLCHHSTLCRVREYPRIMSFVHFPILMSNVECQRREFRGAE 

CMNAMVRGLRAYESYLTRLRMLLDDAPGDADAAAIGSAVTVVLSALDSLIEELPVNNK 
IGGAESNEKTVRALGGQSPRDWLSAFRILEYLQMFLRDGRRAIAMM" 

SEO ID NO 22 CDS complement (20777 .. 21778) 

" /note="thymidylate synthase; ORF 70; similar to Kaposi's 

sarcoma-associated herpesvirus ORF 70" 
<!BO TD NO 23 /translation="MIVLVHLGICYVKKIIPVCVAGIAAARLRVFSAPEGAAAVRCAC 
SEQ ID HO RGDHGELQYLAHLDLIIKHGVQREDRTGVGTRSVFGLQARYNLRDEFPLLTTKRVFWR 

GWEELLWFIRGSTDSTELSRRGVKIWDAHGSRAFLAAQGFGDRREGDLGPVYGFQWR 
HFGAEYRGADANYEGQGVDQLRYWDLINRRPHDRRIVMCAWNPADLARMALPPCHVX. 
CQFYVARGELSCQLYQRSADMGLGVPFNIASYAI.LTYLIAHVTGLTPGDFVHTLGDAH 
VYNNHVDPLLLQLRRTPRPFPRLKILRKVARLEDFTRADLSLEGYDPHPHIEMEMAV" 

SEQ ID NO 24 CDS complement (22245 .. 22592 ) 

/note="R3; similar to Kaposi's sarcoma-assocxated 

herpesvirus K4 viral MIP" 

cr.^ -rr, «n /translation= '•MRGLFVCVFFAVFACWDYAFPMGSMSGPAPELCCLGYVTHLPP 

SEQ ID NO pQLWSYSHTSSQCSVDAVlLNTRRGKKLCANPGDDAVKKLLQAVDKRPKKGRRTRRS 

LIDDSEEGLGSGI " 

SEO ID NO 26 CDS 26846.. 27409 

/note="Bcl-2 homolog; ORF 16; similar to Kaposi's 

sarcoma-associated herpesvirus ORF 16" 
SEO ID NO 27 /translation="MAAVQGPPPPPEEENENSLPVDVYAIEGIFLYCGLGQAEYLHHP 
SEQ «u ^/ ^^^^^T^^oTVT^T VircT.T.cwT^VTJffT.RG-LNAIGOGMIiOINTDGRHNWGRAIi; 



SEO ID NO 28 CDS complement (27515. .29125) 

^ /note="capsid protein; ORF 17; similar to Kaposi's 

sarcoma-associated herpesvirus ORF 17" 
SEO ID NO 29 /translation="MTPVYVGGYVDWSLPKIEKELYLEPSIVATI.LPYTDPLPINIE 
SEQ ID NO 2 y 'jjvPEAHVGHTIGLFQVTHGIFCLGKLTSHDFLALASRLAGDSRAAQIQLNPMPRDPLL 

EMLHTWLPELSLSSLHPEELQDPNHPPAFQHVSLCALGRRRGSIAVYGPDPTWWSKF 
DSLTREEAGKITVNCLDLCERQVTPPEFAAPLETLr4AKAIDAGFIRDRTDLLKTOKGV 
ARVARSTYLKASQFPCAQHCGNRDTRTMSALPEDNITIPKSTFLTMVQSSLDNMRNQG 
HRTYVSAPPSMPATAAYPSWIPPPELTVPSYAPPVAPPFPFQSAFAPQPSPYAATYYS 
PTYGYAPAPSRHQKRKRDVELSDEPVFPGEEVGIHKDVMALSKNILDIQADLRDLKRA 
ASQTSGAQDADQRPQPPPVQFSWPQTYASAPYLAYQPQWYSGTDTHLHAPQPYQSAQG 
IQQTQPPPPQPASHHAGLATQPATPAPAAQESVMSNAIPSASAPRAGACPPLDPECGQ 

SARAPVEASAQPAPVSQIQKMFCEEL.LK" 
SEQ ID NO 30 CDS 28998. .29897 

/note= "unknown; ORF 18; similar to Kaposi s 
sarcoma-associated herpesvirus ORF 18" 

/translation="MFIGRGSVYGSRVATIEGSKYSSFSIFGRLTTSTYPPTYTGVML 
GRCLREPKEMSAGLRGLMWRVIRCENLNTFLPGELRFLHLVLCEMYNYGLNVYLLKEA 
lANTGTRDDIVLGRKVPVEFWKIAmJGLKEMGVSDATLLSETKRGALWLYFNGRPCLL 
KGLGDYVFCQLGLSHSVRWPENLTDGNYLYNLGSVIPCRLLVALSYCLAFWGHADHE 
PWVRLFAGKIFILYLIISGHIMPRKSILEQVGTSGYGGFVEAVCRDVRAVHGIPAWDF 

AS AAPALTSQQTDYLFAFNNS W " 



SEQ ID NO 31 
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f,no complement (29905. .31548) 

SEQ ID NO 32 CDS ^^^^^^ .. ^^^^^^^^ protein; ORF 19; similar to Kaposi's 

^sSSSSpfvnltgsrfneifdiujqkltfrdagsliqaqtslrltaeeglaailsh 

PSPPGLAHEIMKSQFGVYDDYDRVYFLVLGYLPVATSW " 

complement (31043. .32095) 
SEQ ID NO 34 CDS ^^^^^^ ..^^^^^ . qrf 20; similar to Kaposi's 

SEQ ID NO 35 /^^S^f '"rp?"^S^?FFEVSLGRRIADCVI.TWSEHQKTC^^ 

iSNJ?SSSQSHSTDSVAYIGRAAPRGHENWSVRPWI.I.FKNQKri.KTIHTESS 

S??f?JSsSSfsqwedahvrkmlveiptktsaanyrnf^^^ 

S^Srkkkrvcdakstagakgshaakkpapartrqraanaptgnrsgharprnnskh 

g^SS^JSScpStkpatqnrpadtwrrvrchnsprrpgihgkpgspsgapakp 

vhepkpmaatirawq " 

SEQ ID NO 36 CDS /^^^^^..^j^y^i°iJe ' kilaL ; ORF 21; similar to Kaposi's 



iSS^SS^TONyLRQVAWAyHAVFCTWVMMQYLTPEQMVQLCVQTVSIE™ 

JSSSSSSSheqsmipmvaemi.vsvkehvti.mevci.glfkelrklqi^ 

SDASSS5GQ?MSNEAIKPRAVNWPALESYIQTI.TKI.EGNGAy'' 

« rrx? 33754.. 35868 

SEQ ID NO 38 CDS ^^^^^^ ..g^y^^p^^^ein H; ORF 22; similar to Kaposi's 

sarcoma-associated herpes™ 



ElVYI^YMYTKySENYQLTNTFRRNVmNMyEVLTTI^ 

-^CFLSLRI 
ILAWKQV_ 

;;^flnsamfvsavypncshftppgtai.hipilynfsaprigcplcdsivi.sydenqgl 



nveisyminqiakpdditifrvfspcflslrfdldenklrsdapqtskrtgseu^^^^^ 

PaT.U\^^ 

IIHYLWLMNNGT\A^IRGLyRRHALSAIAL 



sgfwrllhafhatrinefsvinct: 



RLAWKQVTALMPLTNITYVISSVRPDHARVYEVS 



QTMMYVSNPTVQANLFSPYSPFFDNDNFHa 
VFAFIGTMSALYFLFKLFSILA" 
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SEQ ID NO 40 CDS complement (3586S .- 37073) 

/note= "unknown; ORF 23; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 23" 
q^ro TD NO 4X /translation="MIKISDLKARLVGGAVQLSNGEYVCHWySSALAAMVGLPGPAV 
SEQ XC «u X pLpLLFKKFSTIYSNMMPLYAPKRPELSMLRIMVSPHPYALNSCLCVGTDEGERGVSL 

FRDPVIRSSDFEDTPITVNSKLVIASNSLFLHCRPFSVPATVKTPPVTLTNNKQITIN 
EIoANTTQEYDPNAPPTLCSALPPDNKKLRSILKQPPATSESNVQSDCLLADIFFAMGS 
RQPQIGESPITAFNTVTIMQRANNSlMFIiPNLKIiKPIQHLFI.KHVLLQRLGLENILFH 
FKMLYANTCKAAGPYQREYFESMLSRVKQRLEDMVFCIjNS IESHDFQKDFRVLSRAPQ 
RLLTATDKYFLMFPPQNRELAIQVGAEVIESICDGTPLSEVLANLSPRVTIQKETGNN 

LLKFYAIiLTV" 

SEQ ID NO 42 CDS complement (37123 .. 39321) 

/note= "unknown; ORF 24; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 24" 
sso ID NO 43 /translation="MLLQGPVLLPACPATVAANAPSPANSDFKTQLAIFCCLATNNEI 
SEQ I« «u LENVSLEALDRAMQTETTFYACRALRRLVLGEGLYPFIHRQGGIVGKTGNEYAGPGLI 

IDDAIGCTFSHlETHTFLPTVFTYELSDTVIiVQSDERILRSLYCSPLMVCGVNYQSMF 
RILCRYLQIWEFEECFAAFTRTLPEHLIGTCYQNYFKLLEPFKILTLARCPPPCAKLH 
LNYL.KFNILGFTSDWISHPELHRVQTVIIHNIESNPVLLKNLSKQNKFQDIKVASELI 
IDYQNIVNQSLDVNLQVKINKKDPGKKPYKWWTPKSTYYLTFPPEVPIFRVAMCMS 
VAEHVCHSCDRLYPNTEFLGPGETPRVLEAMFSRIQYAPKDRDYNFIFNADQNPDRYE 
QARHDHQTEPIiPDMFDPVKHMSLHNFKI SVFNTNMVINTKITCWSLAGTFES I IDI PR 
LTNNFVMKKFSVKEPSFTVSVFYSDNLCNGAAINVNISGDMLHFMFAMGNLRCFLPVK 
HIFPVSIANWNSTLDLHGLENQYIVRRGRRDVFWTTNFPSWSSKDGCNVSWFKAATA 
TISKIYGRPLLKKLSDELNPILSVPYARIDQVKNTIFTTLETRNKAQIQTLHKRFIEC 
LVECCSFLRLDI.GAIJJRAARLGTFDFSKRIISHTKSKHECAILGYKKC3JLIPKIYVRS 
KKIRI.DEL.GRNANFMSFIATTGHAFSNLKPQVIRHTIRRLGLHWRHKAKI " 

SEO ID NO 44 CDS 39323.-43459 

^ /note="ma3or capsid protein; ORF 25; similar to Kaposi's 

'sarcoma-associated herpesvirus ORF 25" 
SEO ID NO 45 /translation="MEAALEVRPFPYMATEANLLRQMKESAASGLFKSFQLLLGKDAR 
SEQ ID NO 4S EGGVQFEGLLGVYTWVIQFVKFLETSLAVACVNTEFKDLKRMTDGKIQFKVSVPTIAY 

GDGRRPTKQKQYIIMKACNKHHIGAEIELSTDDIELLFIDRETPLDYTEYAGAVKTIT 
ASIiQFGVDALERGLVDTVLNVKLRSAPPMFILKTLSDPVYTERGLKKAVKSDMVSMFK 
SYLMDNSFFLDKSDIAVKGKQYVLSVLSDMVGAVCHETVFKGTNTYLSASGEPIAGVM 
ETTENVMRKLLNMLGQVDGGMSGPASYANYWRGENLVTAVTYGRVMRTFDQFMKRIV 
DRPNAQPSVDDDRDAVADGQDSLAKTPIAAAVIQIGDKLVALESLQRMYNETQFPFPL 
NRRMHYTYFFPIGLHMPRPQYSTSATIKGVEHPAEQSVETWIVNKNNVLLSFNYQNAL 
KSXCHPRMHNPMPCGQALGQAFPDPGHVHRYGQRSEHPPNMNLYGLVYNYYQGKNVAH 
VPDVALKATMTTDELLHPTSHETLRIiEVHPMFDFFVHQQPGAQAAYRATHRTMVGNIP 
QPLAPNEFQNSRGLQFDRAAAVAHVLDQSTMEI IQDTAFDTSYPLLCYVIECLIHGQE 
DKFLINSPLIALTIETYWNNAGKLAFINSFPMLRFICVHLGNGSISKDVYAHYRKVFG 
ELWLQQALSKIAGHEWGRRPASELINCLQDPNLI.PPFAYNDVFTNLLRQSSRHPMV 
LIGDEGYETENDRDTYINVRGKMEDLVGDMVNIYETRNNADHDGRHVLDVGPFNENEQ 
HMAVLEKLFYYWLPACTNGHVCGMGVDFDNVALALTYNGPVFADWNPDDEILDHLE 
NGTLREMLEASDIHPTVDMIRTLCTSFLTCPFVTQASRWTQRDPAQLLTTHDDGRYV 
SQTVLVNGFAAFAIADRSRDVAETMFYPVPFTKLYSDPLVAATLHPLVANYVTRLPAQ 
RVPVAFNVPPALMAEYEEWHKSPMLAYANTCPMTPTSLSTLASMHMKLSAPGFICHAK 
HKIHPGFAMTAVRTDEVLAENLLFSARASTSMFLGQPSVMRREVRADAVTFEVNHELA 
SLDMALGYSSTITPAHVAAITSDMGVHCQDMFLMFPGDSYQDRTLNDYVKQKAGCQRF 
GGPGQIREPVAYVAGVPHSDNIPGLSHGQLATCEIVLTPVTADVTYFQTPNSPRGRAS 
CVISCDAYNNESAERLLFDHSIPDSAYEYRTTVNPWASQQGSLGDVLYNSTSRQVAVP 
GMYSPCRQFFHKDAILRNNRGLNTLVTEYAARLTGTPATSATDLQYVWNGTDVFLEQ 
PCQFLQEAFPTLAASHRSLLDEYMSNKLTHAPVHMGHYMIEEVAPMKRLLKIGNKVAY 
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SEQ ID NO 46 CDS 43491.-44408 

/note="capsid protein; ORF 26; similar to Kaposi's 

sarcoma-associated herpesvirus ORF 26" 
SEQ ID NO 47 /translation="MALDKSIWSVTSRLFADEIANLQSKIGCIliPLRDAHRLQNIQA 

LGLGNLCSRDS AVDFI QAYHYLDKCTLAVLEEVG PNSLRLTRIDPMDNYQI KNAYQPA 
^ PHWDNYSELWIPPVFGRKDATVSLESNGFDVVFPAWPEPLAQTVLQKLLLYNIYYR 

VAETTPTDTOLAEVTLYTTNITYMGRNYALDVDPVGSSSAMRMLDDLSIYLC^ 

PRGC^LLTSLVRHNKHELVEIFEGVVPPEVQALDLNNVSVADDITRMGALITYLRSL 

SSI FNLGRRFHVYAFSSDTNTASCWCAYN " 

SEQ ID NO 48 CDS 44433,-45242 

/ not e= "unknown; ORF 27; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 27" 

SEQ ID NO 49 / translation = "MS IP KIMTVSRDNEGTVCEVAVDNGRHRAMIYYPKTTNIiANERA 

DWKEAFDTETPVDIVKQIVNEGIAlSKIO^CVRLALYLYFYIiQYVCFALLLTWQLNPY 
MDPPGLVFAWPMGPKHVTKLPHPAIVAVGCGADAICKNCSVPDIKTELGMVYHNGSS 
DSGQRAHYGIiALLKAAWLVMGNVCPEPVVRQGAALLGPWNRTEWSDFKSAMAATTFCG 
SRGVLWSPIHEKNLCRPTWNDVINTSVFTNESLCPNIPWPESVIVLNGDA" 

SEQ ID NO 50 CDS 45408.. 45683 

/not e= "unknown; ORF 28; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 28" 

SEQ ID NO 51 /translations "MTAHTNGVLTTTGFSTSQPESVQVSPFYRVITKPPVMGLFFCVA 

MCVIALVWYVMRRVCCKGRWADSCRDPRQPAYEMLNVRLRPHGTNP" 

SEQ ID NO 52 CDS complement (45733 46779) 

/not e= "unknown; ORF 29b; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 2 9b" 

SEQ ID NO 53 /trans lation= "MLQKDAKLIFISSSNSSDKSTSFLLNIiKDAHEKMLNWNYVCPD 

HKDDFNLQDTWACPCYRLHIPAYITIDETVRSTTNLFLEGAFSTELMGDAATSAQSM 
HKIVSDSSLSQLDLCRVKSTSQDIQGAMKPCLHVYIDPAYTNNTDASGTGIGAVIAVN 
HKVIKCILLGVEHFFLRDLTGTAAYQIASCAAAIilRAIVTIiHPQITHVNVAVEGNSSQ 
DAGVAIATVLNEICSVPLSFIiHHVDKNTLIRSPIYMLGPEKAKAFESFIYALNSGTFS 
ASQTWSHTIKLSFDPVAYLIDQIKAIRCIPL.KDGGHTYCAKQKTMSDDVIiVAAVMAH 

YMATNDKFVFKSIiE " 

SEQ ID NO 54 CDS 46905.. 47135 

/ no te= "unknown; ORF 30; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 30" 

SEQ ID NO 55 / trans 1 at ion= "MENDTPKDKISEADFQQCQAFFHRPIRDLISSGADALNHFSLSE 

SDGHKLERIVLLLDLVGTECLSYTTIAAKNVK" 

SEQ ID NO 56 CDS 47093.. 47746 

/note=" unknown; ORF 31; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 31" 

SEQ ID NO 57 / translation^ "MSLLYHDRCKECQMTRVNSPICRFHNVSNLYQCLDCKRYHVCDG 

GRNCVIVYTRENLVCDLTGNCVLDNVQDVCSYGPPERRVPDAFIDPLVSHGTRECLKS 
DILRYFETVGVKSEAYSTWKNGQLNGIIGRLIDATFNECLPVMSDGEGGRDLAASIY 
IHI II S I YSTKTVYDNLLFKCTRNKKYDH I VKT IRAQWMRMVSTGDPSRVS ATGCFT " 

SEQ ID NO 58 CDS 47683.. 49077 

/note= "unknown; ORF 32; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 32" 

SEQ ID NO 59 / trans la tion= "MDAHGLNRRSVAGQCDGLFHVILPRGFILANNITCGERQRFFAH 

TWFAASGRTSKTLYVWGRVFQNTDPGRGDGPSGPWSGLAISL.PLFTTNGKFHPFDWI 
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SEQ ID NO 62 
SEQ ID NO 63 



TVADTPDSGSSWTVKFLYMSLIAAYRNAMRGLKDKVSQCTDAAVDGEVHPLTVLKEAL 

SpsrSSSSJSqSypki^aqarhvfssrayyvcvydneei^tkwqrqdprp^^^ 
?wsS^yJSLlflgakqnafvdsiektcrcqnytikqffpvi,inrdnet^^ 

5FIEAJJJlRNQVSERSAV^KAAI.FRNDSirrYWKDVLGLWEHGPHKLGTAIKLPT^ 

SSS^wllcdeditrsisgqstvclwsptltawlvlpggfvikgrydi.ssedlm 
fvasryghpasshs " 

SEQ ID NO 60 CDS ^^^^^^ J^p^^sfsimilar to Kaposi's 

SEQ ID NO 61 ^p"sEN^SLSFIjaVMIVIMKPKEFGPCVTVym 

™gtlsnaprsvpipgplnphpretvpgltkqeiiytsqtvprgqipdaikgkefh 
SSp?SfSSSSwSfi.svdfmllcpaldtvpsi^ivglltqcdkstck^ 
2JSpSgy??pSqgtspscpcliscgarraadvlvtghvnllgl^^ 
SSpSvpLdamsgvtaegtevqptslpwalirlpdlasrvmlygcqnlksiclr 

SY" 

CDS complement (49977. .50960) 

/note= "unknown; ORF 29a; similar to Kaposi's 

pSg^iSwLssvlddylqtyrrvqepmpaptlgkprisshatlprlteelt^^^ 

Sc^SoMM^AKDQYMEYQSAQRTHEAFLECPVYAELRQFLANLSSFIJaGSYVPGV^^^ 

SptoqqJ^fyfiasikapekthqlfatfkqhfglfettddvi-qtfkqkasvwi 

iSjG?TO^v?AIISVLLSSVENVHVGYVAHQKHVANAVFSEVIATLSRWFPAKNLNI 

Sg?^I^pgrrpssi.mcatcfnknvsrcfi,ssgsriasrdwi.npage" 

„^ nr^c: 50959., 51942 

SEQ ID NO 64 CDS ^^^^^^„^^^^^^^ Oj,p 34,. similar to Kaposi's 

SEQ ID NO 65 ^sFJS;sJ^pESRPVRDCPQPEGFDFEHIHI.PKI,TRMQRVI.GRYCDHVNNra 

ISSsSSLfylpygqdewnwaltlrkdklvkmaveglsnpttwkglepvdplpliw 
?S??SSs^Speclyernfgmkgpii.lpphmyapqkdvmtfvhhvikyvkflyvn^ 
gg2ISpspSSsri.raaiari,gdveaddayi,sakcmlchlykqndtisihet^^ 

cSlSSDGARYITSSVRAQRCTSRGDFVLIPLYNIEGLVSMIREHGbGSS" 
SEQ ID NO 66 CDS ^^^^^^..^^j^^^Jf oRF^irsimilar to Kaposi's 

SEQ ID NO 67 ^ JJSJrS^SkKKDDLATVVGTLETSLREKQSEI.GLLKGFNRKKIEEFDAVADAVRD 

LKDELYGELEILGTLDNESVPVEEESPKDDIIRWKLERLPRVCPKSP" 

SEQ ID NO 68 CDS /^^^^^..j^. ^^^^f ^RF ' "fsimilar to Kaposi's 

SEQ ID NO 69 ^ JsSH^i^^iTOKTIAQLKIPNNWGQCSHQATDWTAVLGRGSYGVVRSMSLGRCVKHFG 

sKef^eJSSivracrekhplnrggdrilcflepcvpcralifpqltgnlln^ 
™pSSS?elregvsflnnicgivhcdispenil.ikgelttaygrlmigdlgs 
S?Spwtgvmvtsklgfvqhtyhfkaparfickhiyrpscllyrcllscaggp^ 
SmJ:SSo?SS^Stidisslgysliaclekylqpadpfpqqgaijidassesahplf 

edalenassrlcnsklklvlqkllvrdyfshcgncgdhgfflr" 
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SEO ID NO 70 CDS 53566.. 55008 

/note= "alkaline exonuclease; ORF 37; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 37" 

SEO ID NO 71 /translation="tTOFFSDEPMVQEMALLDIDEQQRLLSKMSLANFLKHERVRAFFS 
^ DNKKEISMPAIRFVYNFYLFAKVCDFIGNTDVYDFyVTCVFRGRRLTRLSEVYDACLN 

MHPHDRHHVCALI EQVTRGQNINPLWDALRDG 1 1 SS SKFHWAIKQQNSS KKI FNPWPI 
VNNHFVAGPLAFGLRCEEVVKKIIATLLHPGEAHCENYGFMQSPLNGVFGVSLDFGIN 
VRSDPKDGLEFHPDCKIYEIKCRFKYTFSKMECDPIYAAYAKLYQKPSMQTLKGFLYS 
ISKPAIEFVGEDRLPSESDYLVAYDKEWEVCPRKKRRLTAVHHLVKKCMIHNSTAPSD 
VYILSDPQETGGQINIKAHLSANLFINVRHPYYYQVLLQSLWQEYISLSKGTKNLGT 
QKNFIATGFFRKRQFQDPSCCTIGEFAPLDPHVEIPTLLIVTPVYFPSVAKHQLVKQA 
TEFWAASAREAFPELPWDLSSLCANAPPTP " 

SEQ ID NO 72 CDS 54963.. 55172 

/not e= "unknown; ORF 38; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 38" 

SEQ ID NO 73 / translation= " MGFILSVCKRPTNTVDVKGEPIDVSKEFDPIIGEESIVLLTADG 

TAPAAliYKPKTKPSKHKNNKLSDFV " 

SEQ ID NO 74 CDS complement (55255 563 91) 

/note= "glycoprotein M; ORF 39; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 39" 

<;eo id NO 75 /translation="MKISRSDSFILSSWVKLLVILGLMFIMSAWPLTATFPGLGFPC 

^ yfntlwysalnltvrssakhltptlfleapemfvyiswaflvdgyllcyyawailai 

FKAKRVHATTMTSLQTWIVLIGSHSWFMSILRLWTIQLFIHVLSYKHILLASFVYCI 
HFCLSFTHVQAMISCNSATWSLRVLEQQIPENSLLDTLLRYGKPIGANLYLSLIAMEM 
LVFSLGTMMAIGNSFYMLVSDIVFGSINLFFVLTIAWYINTELFLVKYLKHQIGFYVG 
VFVSYLILLLPWRYDKVFISASLHKVIAVNISMIPITCIIAIILRIIRNDWKWCAKS 
PEYAPLPQGPKEKTTKVKYSPELNALYETEEDVSDYEDAYPKYI" 

SEQ ID NO 76 CDS 56526.-57932 ^^^^^-;.c. 

/note="helicase/primase; ORF 40; similar to Kaposi s 
sarcoma-associated herpesvirus ORF 40" 

c;eo td no 77 /translation="MNAREVALTGHVLHISLHSTHEREKLIIWQVHLLVCQQCGIQGD 
^ AAYLFVTETLSNTDWGNI PAINRHAPS INEHGRNYMQWELRTRLRNPI IQLLSRQPGA 

VNVRVSEPNMVIVGCERALDHSCSVRVTGAYLHCDTTMDFSLDSWSPTREFWFSEMF 
SHCLVSNIEVYLKTTGGLYYRASSATQCRKRAKDGALGILDIFNCESREIQVAGQKYT 
LSIATATFHVLWVDEACMWNGALAEFFRALHNKLFGDREGVAPTLTYVCPGATPEGTP 
FPPYFSAFPHLPLVFGRPRRLDVTAVQELPKAQIAVHWPPFKDSILGDQLLIPGISPK 
KPGTVPVRWPLWVEDVNLSLCETTESVARIVDPHSIVIIKFSSLLCQHLKCHRAFVKN 
ELEYIATICSSDLRLFIQEEYNRLLATIFTWAAASGYTWAAIDKTTVFIKAPQLSAAV 

SGFCPSLNSCRRKQCYEG " 

SEQ ID NO 78 CDS 57917.. 58528 

/note="helicase/primase; ORF 41; similar to Kaposi's 

sarcoma-associated herpesvirus ORF 41" 
SEO ID NO 7 9 /translation="MLRRLKITVHFLSQEQQKWTRLEAHLGLPVQETSHPPDWLKCE 
" VCSASVFLKIPAGVLYAGLARDPTREAKRDSWLDCLVEGATLLLNNSVLPIGALAGIL 

PTLFANRRCVNFWLLPRAWVKSAPICPPLPIDCVTPPQFWTKRGPICWYKEWPbPVD 

VDFMYYLQEALCVFSWSNGEGTESHADNIRQLEKFEKVIiCIiF" 

SEQ ID NO 80 CDS complement (58525 5934 3 ) 

/note= "unknown; ORF 42; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 42" 
SEO ID NO 81 /translation="MDQILKRI.MGEQHRSEAVMPETECSSRGPYNYPVFPRLMLEVHK 
SEQ ID NO IJ-L KNSICMASNTPKLCVRGRLNVPDLGVHVRTRLQSATFTGFVFACWEHEDMIDALDIY 
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PHVFSDRVQLFKPASASVTELCCILSMLENYDKPPLSFILSALDRARYLHERYTCNDS 
AFVLYG I E V I ASTLAAYHELNPPQG I LRVPPLVRFKLHKLLDENADDMKGLLKP I YLE 
SFRLTENVGEEEGHAETFNI FYCGTIFTRHLHNASVLKYFQITSLHS IPRQTLF " 

SEQ ID NO 82 CDS complement (59297 .. 61027 ) 

/note="capsid protein; ORF 43; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 43" 

«!Tro TD NO 83 /translation="MFKMNPGFGSTCLVHPTELSISl.FEILQGKYAYVRGQTLHSSLR 
^ NPGIFGRQLFIHLYKTALGSCTYDNVLKDWTNFETTLKTRWRGVEHLTPEFKRSTFES 

WARTTOLTVDQLLIJJTINQVIiHTRTVLSYERYVDWVVALGl.VPIVRRTPDGDTIARIQ 
AHCQQMRKTYASGDVTISRIVDKLAQEITSIMTDVTSIYIPDYAEVSVEFNGDKAAYL 
GTYRQKDITVEWSRPIIYNGRVAFDSPIiYRLFTAIMTCHRTAEHAKLCQLLNTAPLK 
ALVGSTCNDMYKDILARLEQSSQKTDPKRELLNLLIKLAENKTVSGITDVVEDFVTDV 
SQNIVDKNKLFGTGTESTTQGLRKQVSNTVFKCLTNQINEQFDTISNLEKERDDYVKK 
IQCIETQLLQSLPEGGRPRHDINILTQNTLQALSGLRDPTIN1.SECHIPKGSSWNSF 
FSQYVPPFMEMLKELTSLWEGEMFQTYNliTPWDNQGQRTS lAYSQDTVS ILLGPFTY 
IIAKLTHMDLINHSLISLSLHDIADQLYVDSRLSVYINDIGHKYCEQISOPGTDGPNT 

EASNGGAAPl " 

SEO ID NO 84 CDS 60966.. 53338 

/note="helicase/primase; ORF 44; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 44" 

«5i.n TD NO 85 /translation="MESSVGWTKHVEPNPGFILNMTSDAKVRGVVDHVSRI.SNITTSP 
" PEMGWYDLAFDPAEDSGPFLPFTVYLITGTAGAGKSTSISALYQNLNCLITGATTIAA 

QNLSRRLKTFCPTIFSAFGFKSRHINIAVRKAHQTGAVSIEQIQQQELSKYWPVIVDI 
MKEVMAKKPNGMYGTISNANFETLSRMTGPCLWTSNIIVIDEAGTLSSYILTTWFFY 
WFLNSWLNTPLYRQGAVPCIVCVGSPTQTNAFQSTYNHGTQKTEISSCENILTFMIGK 
KWSEYVHLERNWALFINNKRCTDLQFGHLLKILEYNLPIPDEVMSYVDRFWPKSKI 
MDPLEYIGWTRLFLSHSEVKAYLTNLHTCLTLGGDTRDTKLFTCPWCEVFVKPFEEY 
KRAVNLTNLTVTEWVTKNLFKLSNYSQFVDQDMSIVATESTERSTQVTFITKFVKNSH 
VSLNGKTKKCICGFQGTYFEFKRILDSELFVETHSQDRPEYVYGFLNTLLYNAMYSFH 
AYGVTRSHEKYLQDLKFAPLPAALATGRVDLQTVREELNLEDDIFYHVCSPPPPAGIT 
SLQVLVDTYCALKDVFASRIKVACRWFGGEFEKETFSAFTVNMWRDGVDFVSPSERL 
NGLLAFASTVESYKIKGYTFLPVAFGRCQGLPLSDDLRKKMPSLWQDSSGFIACLEN 
NITKLTETMEDGSVFQVCCAGDYGVSSNLAMTIVKAQGMSLERVAWFGSHKNVQTSH 
VYVAISRAVNSNYL.VMDSNPLKTLLREPVDNTSAKHIVRAliHNPNTTLIY" 

SEQ ID NO 86 CDS complement (63379 .. 64437) 

/note= "unknown; ORF 45; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 45" 
SEO ID NO 87 /t:ranslation="MAMFLSDPSRTPPATPRMLPIPGAPRKKRTRRFLFAGSRTGLPV 
SEg ID NO B/ ppGYGGppviDMTAPEDVFDQDSPPTTPKTPDETDSHSENSDYSDVDEEDEPPVSSPP 

RIDPHARDGESFNQSGRLPTVITSTGATTPPSAPAPLTAFGGPRPVAWTGQHRAPQS 
SESDSEDDFFIDDYEDTDESGGEADGFSPRASPAWSGDISRSPAEGGWSSDEEEPWA 
GSTAGQETIIISDDDEVDDRGSVETWDESDADEGTGATDVIDLCSSSDSDDDADHVTS 
GGVRAACKRRASRRDCNGDDDVIYVGTTQGPKRRMTSTTGGGATSNPEGPGVSGRQTM 

AATPPVCGNDNYPWPWIiD " 

<!EO TO NO 88 CDS complement (64479 . .65246) 

" /note="uracil DNA glucosidase; ORF 46; similar to Kaposi's 

sarcoma-associated herpesvirus ORF 46" 
«;eo id no 89 /translation="MEGWLKTIVWSKMSPEVLEEPSTQTLLLSDSWLEFLNLSPFLKQ 
SEQ ID no ^AALLKRVMDMSNVTVIYPPIDRIMWWSYCCEPEDIKWILGQDPYHRGQATGLAFS 

VAPDYSIPPSLKNIFKEIANTVPGFTAPSHGCLDCWAKRGVLLLNTILTVERGKAGSH 

ANLGWDWFTSYIISCLSAKLQRCVFMLWGRKAIDKATLINGQRHLVLKARHPSPLATA 

HAATGS PWPQFIiGCNHFKLANDYLVQNRRG AVDWNI N " 
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SEQ ID NO 93 



SEO ID NO 90 CDS complement (65222. .65731) 

SEQ ID NO 90 /note= -glycoprotein L; ORF 47; similar to Kaposi's 

sarcoma-associated herpesvirus ORF 47" 
«o TT, >,o 91 /translation="MRSMyTLSLFITCGFFLITCCTGLWNPCCKIIPLSDFIFPEPF 
SEQ ID NO 91 E^SsFHLTNLALCPGLCTATLRYKADRSTTElCVNGFHLRAFFIRILYKLNYSVPREE 

LQLLNYMQYSLDEFLAEFEDFHINGSESGTAYTRPPLLDFSDRSTKVSRIRKVITRRG 

DLWRVGLKQ" 

SEQ ID NO 92 CDS complement (65999 .. 67168) 

/note="unknovm; ORF 48; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 48" 
/ trans 1 at ion= " MAVSIPVKGVNRETESNWRS I VTTFERHGNADRAIRSLLRFFKG 
VDHPGFLASLVILKDVTIDSEKTIERTDLIPLLQGVRFVTQQIYMHLKDHASESPMAE 
IWRDCKERFCLA1.ELACGCQRCASAARQLRACQQACRPPKLNPHKQQCVAARLLTAVY 
NOMVLRTRVSVSEFCLNALMCVPREFGFVSGDVRVETSRVASCLNLSWLYLILDSYVR 
TDLTNLEMAMSRACRIHGLSAGDPFYSALVWLKNSYACDTNTFFFTVNSTSVTTPILM 
DICASLTGPVPDVIKINMLPLVNDQMHPSVCVERANFTGSCPKVSPTHHLDGLKLETT 
SLTLAADSLDDILQALELICGDDEGILDSYISDINTETEVDESSIEEEIVFEELS " 

SEQ ID NO 94 CDS complement (67398 .. 683 03 ) 

/note="un)cnovm; ORF 49; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 49" 

c^o TT. KO 95 /translation="MSRHYGKDHLLNHMyKFHYPPLGMIVGEMNTLTVNARNPLYQAA 

SEQ ID NO 95 ;lr^ralylskilqvlmqhrqgerfivpqcrsnmvyci.kelhkitndrirgi,insvl 

PLVDAGCVGFDEELVRVLPEILKLEYPHAHELLPPHDPTSPLSWCLSHMVGVTKTFKG 

evkemidtfhdlsvpsfqylaslvkkfflveeviyedyqdtqfnvflnlcffwttvik 

MYQSCIFKDKLLDTIKACIELLKGEARQFFGWyDLNTPNLGSSALVKYTEHLIRALSV 
DSSAIPIGEICSHLHHCKHALLNLE" 

ewn TT> vn 9fi CDS 68494.. 70038 

SEQ ID NO /note=" transact ivator; ORF 50; similar to Kaposi's 

sarcoma-associated herpesvirus ORF 50" 

^^n -rn no 97 /translation="MECASLGPISGLIADLNLFNLFCLYRGSRVKTRGAATCNVPCAE 
SEQ ID NO 97 caSgIVRILTERALCCTEKMFIASACSGWIPPQLARVLHDVYAEMKAKCLGAWRRLI, 

CCRRPIMAIADSVLVTYNTLDAEGKLELRLKALCKLVFQPIFLQRILAPMQLLANGKM 
VPDNYFTITGTAEKRRPWTGSTSGMTCPGSSLVPDSLILPVCEPGLLPAPLVDLSNV 
LENPEI ILSAPPLSQFVITNTHPSLPQSVS I ITPTQGWPGQCFMDTWKAVSQSIHHQ 
AOTPILAAALTGSTSAAPGPHIACSPVAGTSRQVEGSAGVDCGKPACVPQPALPPNVP 

akrmetvaqlgnapvknvhiggrvyaplvnipiidltspsgsgqspadiantpesrma 
agsppfaetaatvpakrkqpredvadkrlkgdvrgaatvnhpfpgpsgmrvreqglfd 
liesstdvtanasgpkndddmlaailqdlyglqsppaidspssnsdneeifpevspps 

SGHGSP" 



CDS 70355.. 70888 

/note~*'R4 " 

/translation="MPRVKTQPKRPQVLEFMPLDLHGGTHTEMDSQNLCPDGQDLLGS 
YTYTENNGPFSQIMHNGQSNTGTGESFGSYAAGDGFLGGSVSGMYGNNTGEGACSKRP 
SACRKRSAALIHAASEASVAEQGTSQGAHAVSDRIGRDGGADNRLLKVSARLSDKTKS 

ALRSHPCIiRCYSLMFNT " 
CDS 71468. .72160 

/note= "^RS " 

/translation="MGFGNIRLGWRLCFMVWVAXIARGRSVCPTWHLTDGKyEAVYRH 

SEQ ID NO 101 /J^S^JS^OSGSLDGSGQTKGSGTKATTEA^ 

HDLPRIKQVNALRLSTPELAQPLPWKSTPRESQSGGTPWNARPHAFIMHTNDMLNPS 



SEQ ID NO 98 
SEQ ID NO 99 

SEQ ID NO 100 
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VVLSFRAIRARSTRDTEQSVRDRNTVTTSYRTPGRPSLFQARPSSHGARLPPSPRTMA 
RYAESRTICDQN" 

SEQ ID NO 102 CDS complement (724 01 .. 72 820) 

/ not e= "unknown; ORF 52; similar to Kaposi's 

sarcoma-associated herpesvirus ORF 52" 
SEQ ID NO 103 /translation="MSSTRPKTRAPKKELTMEELAAQVQKLSVENKQLKKLINSGDPT 

RSGSDPVI SNTEKEAKIAAAVSALCNVATRKIEAKVRAATAKAVTRGQMEDALAGIS I 

RVDVSMDETTRGGIAASADGALRRRRAQSRTRNWDAD " 

SEQ ID NO 104 CDS complement (728 84 73 198 ) 

/not e= "unknown; ORF 53; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 53" 
SEQ ID NO 105 /translation="MTGSIVLALALlACLYLCLPVCATVTTSSTTGTGTPPVTTNPSA 

AP S VTPS FYD YDCS ADT YQP VLS S FS S I WAV INS VLVAVATFL YLT YMCFFKFVETVA 
HE" 

SEQ ID NO 106 CDS 73274., 74146 

/note="dUTPase; ORF 54; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 54" 

SEQ ID NO 107 /translation="MAEVTAHTVPYAFDSCKFEIIPKNNSSRIALRNKFPVVVKPGEP 

LWPLGLKIIRAPQCAFFLSGAPTDEVYYHTGLIDQGYRGEIKLIVLNKTKQWTLYR 
GEVNVSLIAFMYASPGPLKCPILNLPHYSLDAGFDVTSPHAMTIPPTDRTPFTLSLYY 
KSPQLSTPHVPLIVGRSGLATKGLTVDATKWTQSIiVHIiRFYNFTKEPIDIPANSRICQ 
WFIHEDHVPSGWNILRSRVQLGSTLQISWAKIRFTDVATLPKTHPLNSRHTQSQTEP 
ETARGAKGLGSSGL " 



□ 

SEQ ID NO 108 CDS complement (742 07 74 83 9 ) 

/not e= "unknown; ORF 55; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 55" 

SEQ ID NO 109 /translation="MAAPGSFWTCCGFSPFGRVGCQYRPLPDPliNECPTHWRTEIAMG 

LPPGVDMGDVKQAEMCTAALRQTYLLAVQSNKITEYLRRFDAARVPAGCQETVRIQIS 
KLKSIQNVIWNAMLSIAIGDITVDESAFHALLNKRADETVSLLEMEKLATTIASDDSV 
TWAAEINNVLVDTEASSNPSHPVIRQPTPQIAVADNIVPDKIIQDAQADG " 

SEQ ID NO 110 CDS 74851.. 77337 

/note="DNA replication protein; ORF 56; similar to 
Kaposi's sarcoma-associated herpesvirus ORF 56" 

SEQ ID NO 111 /translation="MVDEIRAIFSTSGDMAEVITDILTETQATASFFCVLHDRGDAPI 

NTPHAVIKLCLPAKRPGGGPRCLPLMVXNLPAWQVNLFLTGDAPLTSDNIKDRIDIiAQ 
TEEILEPILSVLACKRSAQQTKHDSFKSKVAWFRAKFVSALRKVYKMTPSPYWMITLL 
GSFEASFVLAGTFYFFQSHICTAETLVHLTRLFSSSQGQSLVTVNTYDELGRVFGRSD 
FLGIVPNFWAYLKYKMQQDDVESRAIDQTINSIRGGLMLSPQDLVHFIYLSFYECMNA 
QTFLSYSRTTSSLPTPATVNPPQLCRRLEADFKEHVMAYYNKASYLSTYITILTVPAP 
LPDGYENFQELACQYWCGQSRDVAEIMTRINDQYPQLNIiTKDLSGLLDlAALDQYSGG 
PKENLFTVASRIPTYRCEFLNKQYFVLMHADCIDAYWKQNIIVPEDAQLQGLTDQDLT 
SRI F YCDLGIiS LPTFKQQI LVSRHE YFNPRL P VYRWVLD FDLKVTEGRRTLND I YN I C 
VTLRQVILETLQLIGPIiKPNHPVYFFKSACPAVTWPDDISDTAFCHCDAKIGMRIVTP 
FPSGYCLVGSAPLVSLTDILNRWKLDTRLASEYPGILEDKGPFDSGIYAKGRCVRVP 
HC YKVGPGGELSRLLKI 1 1 CHPEESDKS AYLKNAFKVSNLLHHAPGDS VTKNGHLVYA 
ITDENEGFLESKTKNNLPKTITDLAEKIERTTEKPLIDWAATAVWPKLHDTIQRFFPD 
DRIGQFASVSFMHSGDNIIQVKPQKGNNFFCINHKHRNHTQTVRVFL.TLHSTKESEVT 
VTFMSQCFAAKCNHNSPTAHFSFMVPITGT " 

SEQ ID NO 112 CDS 77578.. 78906 

/note=" immediate-early gene product; ORF 57; similar to 
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Kaposi's sarcoma-associated herpesvirus ORF 57" 
SEO ID NO 113 /translation="MRYVFHALICFIGGISSSDFDDSSSDEMDDLSPTPEPEPSTTPN 

SFPEGPKSQWALPKIRKRSRSETPVKIEHRSPLNRSRSRSRTRSGSGQRSNQSGRYV 
KRFKPTVDAPRHREPWHRGGKGKAPFIRRDAMAGRGRRTYGHDYRGKAALTRSIKESI 
KKMHLPSTMLSRAHDKKVFEGLLPRHLGQCFQVCLPAPPPLQPEVFTDRQLTAIVKSG 
GRRDAIiVAKKVSIAKLTSLYKPLLTFVTGRlWQAHWIJ^TRKITTIASAGLE 
GLAWAQVCVSQNRSLNDSNLDIILDSSQSVCTWFISKIRHLHIQCFLENQGEVSLVKQ 
LTYLVCINNRLAEAANIAGEVKLNFKLGMLIGFALTLPALI^HKLSGESLYIiFRSFL 
EKYRPGDVMGLLNS I WEHYTKCRSAECVITTHAMVGSGENNKGLFFFPV " 

SEQ ID NO 114 CDS complement (79266 8 0513 ) 

/note="R6; similar to Kaposi's sarcoma- associated 
herpesvirus vIRF K9" 

SEO ID NO 115 /t:ranslation="MATWRPPQSGGPSAMGLREWIVTHANLATYSGLFWADDEKTRW 

LATTTPWSVGFDYLRDGKMYEDYCNQRNIPLPSGRSRLGQAKARIiLGAIRKSAYFIEE 
KDVLRRSFSFT^NWFRLRSDEEMLCRLCPRASGVAAELRGLRFRMFKRKGADEAGRVS 
EYTVKQLLGLLRTRPAGTFTMTAPATEASATATASGEDGRQDNSQGGAVALPGEHAIiP 
LSASSGLSACLAPSVDDPWGFMHIQVYYYGVLQAQTFTHSGMGVRLSTRPTDKNEHHV 
CMAPGPLQLWLPPAPYMDDDFMLSRLWALHALEDGIVLCSCQYGIMMNGYGFLNL 
RGNTSNTSEPRRVPSGVGHRVFDTDEYMLKLAQSPRPSDPGPPDPFAQIWVSAWSLYE 

EEDQSQAPICIWHQREIYRHFE" 

SEQ ID NO 116 CDS complement ( 80686 .. 81933 ) 

/note="R7; similar to Kaposi's sarcoma-associated 
herpesvirus vIRF K9" 

SEO ID NO 117 /translation=»MAGRGVDIRAWLVAAVESGEYRGLWENEDKTVVRVPWNKVTAD 

RSVWNSEKFFDDYCNMRGICQGEKPSHYGRFRKMRFLYDMRHHKSIRELKFINKAYGR 
PGARYRLFRL.LPEPWSCAMCNLMSSTETQCLGLISEFQYDQGGGSGRERRRVFTATV 
LARSRMDKNKRVREHRLPGAIQLTFLYFGETVGLERVHAGIRVCSRPCPVLAGHACCF 
QDERTLFLPSPGWDCQFAREDLRVMHKKCEKGVIilTLTDTGICVKNLENREMKVLTN 
NEEEYKDLPSRQPVQVFDMVDYLRALARSPNPGDEPPRDYAQlAliCLSVQSPNPVDAP 
lAMRLRYVCETSSVCGTEGCFYPGTTVTSEGRTDCSFQMEDPGEGTSQSHDPAVELGD 

' SGPDSMDDPDAGTSGEDDGVACS " 

SEQ ID NO 118 CDS complement ( 82262 833 17 ) 

/note="R8; similar to Kaposi »s sarcoma-associated 
herpesvirus vIRF K9" 

SEO ID NO 119 /t:ranslation="MERPVRVTKPSSLRGWLVECCETGRHPGMRWVDEERTLIRIPWN 

HDRGSRGVEEAEKNIFIDYCRSRGILHAAGRELTAKECKNWLSSAIRHSQTVSDVSTK 
DNLSTPYPDRCRIIRLLPITVRSCARCDQASGTTAMLRGLREEAVNKFGPVGAGVRYT 
GAVGAGGEQCWMLRIMFYYYGDRVGEWTESSNGIRVLPLSERRPQGHICAAPIAEQA 
LVPEIPGHLAEFQAEALRFLDKDLLRGLAFWADPSGIYIRWLGHSLAFVQGNVESTGA 
VAVLSCANACRAFNLVDYMTAMARTS PDGAALPQACVYLYFGGVPTPEGGVQSTVPL I 
I QLWHECIiWRALS AANV " 

SEQ ID NO 120 CDS complement ( 8 34 91 84 252 ) 

/note^"R9; vIRF" 

SEQ ID NO 121 /translation="MDSGCYACILDENSEGIINYLEQVCGIGLEPGMPLPAPLPTLVP 

PTRSAYARAHRLGVPEAPLPHQIVPFWRLRIQVFYFGVLALDHTSQDRRGVRLHPRPV 
PHPGHLCFYGTGFTVWFPSPDREKLTAEQITQIKTMLVAYNEGIYVHGNETGVYVDNR 
NRETLYAAGNDCNGDXIQREVMFLSKQKIFNFMGFMRKLARSPGPESHAPCNGATLYL 
S QQPG AQE S PQVP I S VWCQDELVQGHMNPS RWCS " 

SEQ ID NO 12 2 CDS complement { 85 052 .. 862 09 ) 

/note="R10; similar to Kaposi's sarcoma-associated 
herpesvirus viRF K9" 

SEQ ID NO 123 /translation^ "MAAGESRRGPSRYGMALKEWLTFKADSGLYPGLFWADEQKTRLV 
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LAATPPSFPNYDYQRDGQHyDAYCELRHIPL.PSGRSRLCQARGRLLGAVRKSKYFEED 
KEFPTDQFPFTALVFRLRSSEEMSCPVCPRVCALRLELRNMRFAMLGRGMLHALSGPS 
VSDQERRYREGHQDGHDAQDDDAAYSSGLLRARLMACAAPSAGDPWGHMHIKIYYYGQ 
LQAELLTATGQGIRLSSKPTNKAGHHVCVLDGPLQAWFPPIPQTTESSWQRLEDALK 
WL\7DGIIFCSTSRGIMFTITGGPNVWFQGNTVEPYSL.PHRAYTGMHVWAFDTDRYLLD 
MARSPSPRDTGPPAAFVKLWVSGCSLGEERNSSRAPLSIIVYQTEIYRHFE" 

SEQIDN0 124-^ CDS complement ( 86355 87527 ) 

/note="Rll; similar to Kaposi's sarcoma-associated 
herpesvirus vIRF K9" 

SEO ID NO 125 /translation="MAERDMDLKAWFIEAVESKRYPGVEWDDEDKTIlRVPWNRCTDS 
" RVDEDYNKIFDDFCSARGVCQTGSHAQKFKKIRMLYAVRSHRYLRELTPPSKAGGVSG 

ERYRLFQLLPEVTVGCDLCNLIATTSLHSCSMGSCVREDVFERTRRPRAKAPLRVSVY 
KRKSKRLQDSSAQPVLGAVEVSFFYFGENVGVQILRAGSGVRICGIiPDPKRPGHLCCA 
DNPLTCFLPSSQLIPCEFARAr)LQALQKTCERGLICVMTESGICVKNI.EERNMTAI.TN 
YSENYYELRPSQPLQAFDLLHYLRELARSPTPGDVPPRDCAWIFMCPSTQSENTWDAP 
lALKLRYVCNDDVSDDVSNGAAGDDSGDEGPSGAGVGASGTTGSTSVSTLAPYGRK" 

SEQ ID NO 126 CDS complement ( 87 894 .. 88961 ) 

/note="R12; vIRF" 

SEO ID NO 127 /translation="MAEGRAGSIRVNRPSGLRAWLLDCCDNDKHPGMHWLDEEKTLVR 
" LpWNHLKGAGGVSDDERNMYLDYCQFKGIRQTGNRRLSVRECKNWLASAIRHSQTVED 

VSTEENLSAPAPNRCRVIRLLPIFVRSCPLCNEADATGGMLLDVRNEVTARFRYLGAG 
MEYEGAVGGDGEQCWMLRIiWYYYGRLVGNMEVGSPNGVRLLPAPKRPLQGHVCAGIR 
PEQALLPHTPQDMFPHQTSMLKWLGKEIIRGLMIYADGSGIYIRYMGHVPAFLLGNGG 
SLEPVDIINNARVLRVFSLAQYLSAVSATPPHGTRFPAAYASLHLGGVPTPEGEPCPT 

I PLS I QI WHECLWRACGDAAQ " 

SEQ ID NO 128 CDS complement ( 89122 .. 902 16 ) 

/note="R13; vIRF" 

SEO TD NO 129 /translation="MTEIEITHNHLRRWIISNLEANTFPEHLCWCDEEKRSFRlSWHR 
" GMSGMQPWAYCLDRDLECGRQHNVSECRKRLLRVLRENAGFEQDDARATTTRFGGER 

FFYLRPAVDPLCYACILDSHSETVLNYLEAACVHGLEPGTPLPPPAPAEADGAARSVY 
ARAARIiATVAPPHPDQITPFWRLRIRVFYFGSLVAEHTSQDRRGVRl,HKRQDPKPGHE 
CFYGTAYKMWLPKPQLDGPLTPEQRETVCEIINGCEEGVFLHGNELGMYVDNRTRHTV 
RCAGNDAEGNHAQRAVRSSVKSQIFYVMGLLRRLARSPVPGDTVPSNAVTLYLGGRPG 
SSKRPQVPVTLVICQDELTHGDIRAARWIL" 

SEQ ID NO 130 CDS complement ( 90462 .. 91544 ) 

/note= "unknown; ORF 58; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 58" 
SEO ID NO 131 /translation="MGTYTSEASLAWLSFMSGTVSASPFILCFIYHSLYFVEPLISVE 
SEQ ID NO 131 ^jipsWGAVGLHGLLLLFCIFGLPAWLSRQVDVPCTISAFLITAGSMASTLGVDLPWV 

HVSIFVGSCLCLLLCWAANDWYLCPTIAHRYYELGFIAAFSVYYFLVLKNLFLAPV 
FLLPLVAFIVGGVCSLRALRSHPLYEAGLQRRHAIFSLTSRRYITYSIKQALEVCGWD 
FYIiVTVLIGGAAAGTLSVGLTTPLLLGLVHYFFVFHVGLFCCLGLVFRSNVLALVYVL 
AAAVLLTLTHVLGPGTHNLFTRVCVFTVFLLTMFGAIGCELQI IRKKLQRAANS PRI V 
LGVCACGNLL.MAWFFSL.NKVELGAL " 

SEQ ID NO 132 CDS complement ( 91555 .. 9273 9) 

/note="DNA replication protein; ORF 59; similar to 
Kaposi's sarcoma-associated herpesvirus ORF 59" 
SEO ID NO 133 /translation="MPVSFHYGARVDVDALGSISRVYDHIKGIVKKGVlQISGQGRAP 

VLSVLSSVGDAGVLGLRLKNALAPLMVYSDMTDEVSFSFRNTSLGNTFTHTREMFGVN 
lAEMNVAFYHHGDESDAEGKPQFVRTTIAYGDNHTSTVHKSWDEPNLPSFHDRLEQA 
GTGNRLFLTVKTLTLLLKWLRQQKTRAKQWTVSLSETLAVATFTVDGVSKIIDFKPD 
TPDAKWTCARGRKLDVGWSSDLTTHVSLESLVAALNACKIPGFFLPGFRWHANEILE 
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VEGLPLTDSLADVRLGVMLLKVDPTDRNNAVPGNLSEGADPEGVPELPSPPRTPDLDL 
KEQCVPIAEDGAEPTDGGAKSLRTSGSRPEKIOiGKRKHSSSPSRGKGKTKTPRATFNP 

LP" 

IS complement (92 868 . .93812) 

/note=" small ribonucleotide reductase; ORF 60; similar to 
Kaposi's sarcoma-associated herpesvirus ORF 60" 
/ 1 ran s 1 a t i on= " MFGLS I VTAAMES PDRFLYASDHPGFLALTQETWQNRWFPSQI S 
LHEDSDEVRLLSPTDREFYQFLFTFLGMAESLVNFNIEDLVKEFSNHDVTHYYAEQVA 
MENIHGK^ANILNLFFGGNRGDLMIYAKKIVEDATIJUaCIDWIJiSRVRKATTRAEKV 
LLFLVIEGIYFISSFYSIGLFRIrRGIMRGVCLANDYISRDELLHTRAASLLYNTMISR 
DESPSVAYIHGLFREAVEIETLFIRSKSRDVTMVWVGDIEQFLQATADRILKSINIPP 
LFGARPPNACPLSYTSAKSVNFFERDNSEYVTSVHNDL " 

IS complement (93794 . . 96160) 

/note=" large ribonucleotide reductase; ORF 61; similar to 
Kaposi's sarcoma-associated herpesvirus ORF 61" 

/translation="MNTETSFSAAKSAKPLTLVTDAGTGGCSSSLDPERCAESLVNSL 
KATLGWDIEANSLTGLLWHRIMEDRCLVTVRDYLAVFGERLSDEVRAFMSKHEAALDG 
LLQDFKQSKAYTNFVNCG YLSAVRFYDTYVliRTQG S S P I FES VAQMFMRVAVFVACQC 
IKFPCLRETLRHI.VESETELDEMYLVGYAFHYISSQIVCCATPVIiRSAGIiRGGQL.SSC 
FILKPSMATEDKTLKALHEEMSPLLASKSGVGIDVSSFAEHKNITSCLKLINAHVGYF 
NDNNIRPVGASAYMELWHHQICDFLNAKMPENQERCHNLFQGVCVPELFFRLYETNPD 
GQWHLFAPEVAPNLLKLYGAEFEIEYNRLVAAGKHSSSLPLKSMMYALINTVIKTGSP 
YVLLKEALNKHHWCETQGSAINCSNLCAEIVQQPEGQASVCNLANISLPKCLRPHRGE 
SGVEPGKGDVTFGFELLDDAVEAAVIIVNACILGGTAPTESVRRGQEERSMGIGVQGL 
ADVFAELGFGYLDAESAKLDVEIFQAMYFTAVQTSHEIVLLGEGTPFRGWERSRLAQG 
VFHWQTWDGVKPSHPPLERWEQLGRSIAQHGIFNSQFLALMPTAGTSQLTGYTEAFYP 
FFANIASKVTSKEEILKPNVTFFKRVKPGDLRTVRRYGGDVASFPEPLKDRYKIFLTA 
FDYCPIKQLERAGARAPFVDQSQSLNFFLKEEQATRASYIRDLLLTGYRLGLKTMLYY 
CRIQKQTKLNALQCLDQWGDNI SSEGAESNCVQKADGERTKVCLACQ " 

IS complement (96163 . .97158) 

/note="assembly/DNA maturation protein; ORF 62; similar to 
Kaposi's sarcoma-associated herpesvirus ORF 62" 

/trans 1 a t ion= " mktrdanvnklndslmrllpppphrvslsrgrdfskgvrdlls K 
ywstttgveaikdgflsvsptcqtygdfliysqtmssqeprgtylfsfkqtdtgssi 
dmlftptslarlsrmdadsapqtnriacvwghesglldaipnfeelletgslhqfla 
pvgplvqtvhstfvtkvtsalkgnwarepwthigltlpsdmfvdlddscpsslrde 
plpahssiyvcltyirvnnrpalglgffksgkgyceiaaqlrdfysgvirtkyiqlqn 
dlyinrlafgwcrlgsvpsglqpsfqslhfkgaalpvlkftefvsnpgswklfl" 

S 97157. .99976 

/note=" tegument protein; ORF 63; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 63" 

/trans lation= " MASS IPAARADNGDENTGGLYKLTDNLLTCTGSLQQLKLLMEFQ 
LKPLPTAHLLSMPTVTRFLNTAFKIDNPLVSFIQKHPVFFLMRVARLPEPVITDHQSA 
ETSTGILSEWNVLNTAIRKPHESPAAKDNDYLDNRAILAMITEYIHHVTSRTPSGIP 
PTPPMGISHLPCVEQILHETHRQYWNLTLPESLFIDIGEVASPLQTWLILSYCKKLQL 
APPPLFPPVDELARRLVTGHHELFVPLSTSLETYITMPVSKQRAFEIYSVFAKSKNIV 
DGTPILAFTDTELTTFTPELLFLYDFVIESLCKNQAYGCSRNAIEHFIKKGIDFMAEL 
GAFIEKTCGYRSTVSLSNVRAVKARLASCGLSKEACEDFRAMILMTPHETTPKWENFT 
DFLEMVNQLTLYGFYFYECLNQYSPTSISLAKIQNILNRVDAEQSDRALWRTPLIGSF 
PFPWKLNNVLAFFKPSTPVATLQKIYKAIPSYLMRSLFEIAANKSWGNIALAESAPLT 
DIQTAEPDQGPVSAQVIAKYCSRLQISATDYDAAIVSSPGFAAEFIKTKLYPILSEVL 
RNTSKKNRSLFQIRWLIVFAAEDARDLAPIRRSLALAYFQIMDILEEKHSPESFYNLL 
DYLQETFRCIRQVIPEATCPQEFLQYLFTFQNIPIAASFIQTSMTFVDDLKNGIPGIL 
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DLVSLGAAFYNMKLLYDSTIJDTVEIPTEEGQPIWSMFVFKSTIRVLEKLLQEAVIAL 
TQTSEPMYAAHIRLMQHLTYMQKIAGHEIMTTQLPSVFHEIHEGYLQCFKRFKRLMLH 
VTGSCCYSLTRYFGFLYQPPLIPDTIVQKILNFNDKTDTTDDILKSLSQPVRQGPLSA 
ENES S SRLS KNNVELLQKLYDDFRTASTNNNPTS I KLE YSGNYNETQVS VDWST YNLV 
TYTAPDDTLKFTPVNTEALDRMFAE " 

CDS 99980. .107626 

/note=" tegument protein; ORF 64; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 64" 
/translation="^5ELPPIFSKFKIEGVATTHQADCRFGQYAGSQCLSNCVIYLAQS 

YFNRESPVTDTNDLDDVLRQGATLDFILRRSGTLGYNQYAQIJIHIPSFIKTNEWTAAI 

FQSQEYFGLIGLDAAIREPFIESLKSILTRNYAGTVQYFLFICGDKAGAVIIKNKTFY 

LFDPHCVPHVPNSPAHVISSSDPTAILEYVSPPDREYTGSFLYIMPSEYVNPEHYITN 

HYRTITFAKVHGPHIDISTGIEPCTIEDIPSPPRSPDVTSKSSNIiARVPRTTTDTSSA 

KPPPATLSGLRGAEPPTSYPDPATNDADTKLIiTPAPAQTAVDHPEFQTTPGATLLLSE 

LSASRGRKRKLSSLQRYSDSDEASSDDEGAPRRRVHDDAISAEVIWMDDDISPLYSPS 

ATPSFDDVFDSPPMSPEFTYEDATEDTDGAFLEQIARDAETPFSAFDDLITDHDFSSI. 

DKKIEQLI KYEAPSQHLPNISDKQNGRAVREAAALQAMDKIMINI ILEHGLITDAQAR 

GPSACKNVLQFFILWGEKLNIPISDAKQVLELDIiQLIPLHTAISEGKFKQGAFKKHLT 

TKINRCIASMRATHADAQKKLASAFNVEGSQISSSEAKISVRALKEQIANHLSPGFLA 

VYSADEVKHLRDKIQDLKTGIEQRNKEIQQEELFFDAMLTALDTFQPPPKTAFPMEIF 

PHRKTEVMLDHLAS ITTRLTED ATEALNNYLETPPDQGTHITNI PNFSS I VAN 1 1 STL 

KILTYAENDMQLNVTPMATYRRQLLYLGGELATIFNLEWPYETVPPVQELPLVARAKA 

KMESVTKMEKNQQAIJ)QILGDAETLLDTITATSGDENPVRAMSIPILETYITNAGALI 

GSSRNQRFEKLKAAIHDIASSESFIIMLLNirrRLDNISDNLAKIDGILTNNTRFLSNA 

TVSKTLQTLGGSLIRECVEALNKRSPSSLNNARLLAVQTILGHASVPDHETLTRIVSG 

VASAQKESAGDDPDRVmiVTGHI^ELKLVTTQSRVDKATRRiabI.MIITRDLKEAEVSQ 

ETVLETRWQENVLKFQPSTSKEIEDFLQSAPSAKARKFAEKHLRTLITQFNGHERPPS 

EATAVPMDYTPTPIPTPQAVSTATAEKGKAAWNKIQQAFQDFNFHLIDASDWQEI^IASE 

YSRHGSSLPGTVGPKLVRFMESISNTLDDILTQKIiASLLPNGPAFRPPAFDWIAPYQT 

RWAFLKTIGLPMVRNIJUDKIHHQCQTVSHAVQSADLQQATVGTSLERPAAEYCRIL 

DMQVAFNDHGIAVRSEAAAYTDAINSPANWTPPKPNLEAPKKLITATDALTVEDFPD 

FLKTS ILQQEQRLIALQRAEFQQLEAS ISAAERLRQSTRDEI AGKMATAITQLLPRAP 

VAISSRPLNLSKPIDFLSSTVYDKILDKEPYETAIAGFAWLEIATKSVMVYSQQNETQ 

QLNVLLSEVEKQSTVAQRIJn)LELSAKNTDDVKVLKQAI.DEIAPLRVK^ 

QKLESIESLLRATRTAGEISSEL.ERIGTQAVGTITVRDLGTLSDQCREAANFLRQASL 

PEGFSDIGTKLSELQAYIKYKKQFLEHFETTQPNVFQRFPLSQNITENVPARPAMDSV 

ARLTNHLHVRGSAPHFTTWIETLPTVDPEKPTHVPAHGGAPLHRQITYSNVLEALFSL 

CSTTLTPVPTAPGLEIATRARRGAEAATWMDRQWPDIAQTLQDVLDTYEHTTAHANRD 

AAFNTFLAMCVFTQIIRGASRAVTLPKLPSTAVDFPEEIVLTPRECTTLVTAMWPTL.A 

AAILRLKSYSEALGLMSRFLPLMFQALPHLTLEAQVKNGPHNTPPQLRCFAKTEAIPY 

FPAQWQSANLEQSLWGQTDFLQICDNNQRKARVAAVTWALTTIDGVVLDQLWSTFKPM 

TAASDDTYVDLVETLHLTTFGPRGPTPRRETTTEHPPYEYGQPTGYCISGQSTTPVQA 

SNTPVSAFEAVIX3AMVFHVPIRIFLAATPKRLGQARGGMGLLTPILECVPDVEPFKSL 

YNAPRKPVPIETLPASLHPHDERQVFLRQAQWLSYRFTPHEAARSSTPPLLWIDPEN 

LVTATYSSGGPANFESRPFYVMPGPYPPDWPKTLSVTSNTSVTHLSHDEICNLFTTLS 

REHGTVQGRDIFAAAPTNVTPEQTANPPAWETDNRLITQTETAKKPHIIPASPKARTD 

PPVETTTHHSQGQASQHAWSNVNQPGQITSHASRNTPSTAPQASSSPEKFNTQTVPRL 

ISQTSETAHINQPASGQVTEPKGIFGTYKPRVLTEPAKPANAGVASRQPEATTTVPKL 

PINPPTARVFIGTASKLSPAVEESHGATPDAHQSKIDREKYAESRPRRTPHLEEGPRE 

PHVNTPTSAHINVPSSQGQKTVHGRENPGLQTATPSAPQPTASNPRIQYTLPRTDGRL 

LHDESEVESTPTEEVKRSPKTQDVSHGPEPDDSRWTAPLGPTIEIHRLEHPQILKNIT 

SLTVPTPRVTPIPPTNIWIPLSHVNIQHEEITRAKNVLMRFIQNVRRKLQASSDALSE 

AIARIKFLYL" 



CDS 



complement (107637. .108146) 
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/note="capsid protein; ORF 65.; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 65" 
SEQ ID NO 145 / translation^ " MSSLRVKEPIVQGRLEHDYPNHPLVAEMNNLPQGDMSPAQYAIA 

KRNYLVFLTAKHHYDMYMQKKNGIIiRKDHLRGLiRGKKDASSSISGVLSGSGSAAPSVA 
PVASTLGSNSFTTISSGPHSLIGSMGPAPGGGGPGSVASSGIGSTSLSPSDATTLDTR 

RSSQNKKSK" 

SEQ ID NO 146 CDS complement { 1 08152 109498 ) 

/note= "unknown; ORF 66; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 66" 

SEQ ID NO 147 / translation= '»MASGRLPNLAEDEAACHGRGSYPAHRWLDGSRLGLDLAASIRSI 

GLCPECYVCFVTYGLGAWDGRPPKWACTLISAPSFQTALNEIATGWRPDNPPKNGDVR 
SRIiHDIGRSLLEAYAWVLRCICTGVGCPSDEGLSLTAVPRSAWSRYLWSFQRACCLV 
CKTLNCRQRFPLVTCIiPQHALDLPVLRKKWNGGGCVSMQLNVPS I SRRIiGANLNESVP 
GPSDAGLLASLREIiAPTVPCGNPFNAI.LRSLTFRALLSMSRVVLPIGESTETEISRDL 
GQKVLAYNVLFPCISLPWSQWARSVLEKTVPAPRVVVCLECGYCLNFGRGKFETVN 
FPPTNVFFSRDQKEKQIiS I CATTGRVYCSYCGGSHMRVISIiFEITCVGDPYLRCVLAN 
NAAHAIRDANSLVSVWPCLASPDCATGIiliKHLRVAELFYLTSSISSLSCGKCNRS " 

SEQ ID NO 148 CDS complement ( 109524 .. 110198 ) 

/note= "tegument protein; ORF 67; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 67" 

SEQ ID NO 149 / trans lation= " MSSGKRLVDELCDLWSYLGPSGISLDLERCQDGAPVYAKGGAV 

PVCTVRLQHGCVYHLEFVYKFWLHKLERIiAYPFAPCFVI INNGIiATTLKCFLCKPRDA 
DAQFGKNLPINSDVYLERNSSVSLGQDDFMKFKARLVFSGDIJ7VYSSMVICRTYFTEH 
RQVLQFLVVTPKSAKRLKTLLRTVFALTGHSDGIiGAIiRRTGSVARPSGSELKDIGRGE 

RAAMTN" 

SEQ ID NO 150 CDS 110609 111982 

/notes "glycoprotein; ORF 68; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 68" 

SEO ID NO 151 /translation- "MFVPWQLETLMRHWPSLRGLVEQSFLPGTPDGAFNSPVLIHTQD 

SLQPASSCRVCSLLFTIiVRTFPPPDSFFEDYGWLCLTCLYAPRSWTATLMVAADLLEL 
THVYFPQCVKDGPVYTAQSILGIDVQIiHFFATRCFRPIDREQILHTSHLNFLQTEFIR 
GMLEGTIPGSFCFKTSWPRTEKDDQQPTVACCSVGRGSHTNRDNRLPEDLEEAFNSTN 
AEEKPSLLGVFSATWAESQLLGSDTQQADTHLQPSAFPTPEDADQSQGPCLMHPTIiNL 
KTKNHTASICVLCECLAAHPDAGPVIiKDLRRDILENMENNVKLVNRISYILNDPDSLS 
HVRDEHLRGLIKRCSAQEIHKHFFCDPVCVLNTYSHCPAVLFKCPPPEKYKKLKARLA 
TGEFIiDCNRIFDCETLQTLAVLFKGSQLAKIGKTTSLEIIRELGFQIiRRHNIQITHPF 

QTSNLYI " 

SEQ ID NO 152 CDS 112 004 112 8 97 

/note= "unknown; ORF 69; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 69" 

SEQ ID NO 153 / translation- "MPKQPRSRLASRAPYAPSVRRPDGPQSTRPASRHGSCKSEIMQW 

KKLVSDTQFFSALTRRHELGVDFLREMGTPICTSKSVMLPLNLKTIAPGRCVSLSSFG 
HSSNMGFNCSSCTPTDRSAVSLDANALGEDSARKNSELCSVALTFYHHAEKWQHKGF 
YLSLLSHS^ffiVVRKSFTQPGLLYAHLVLKTFGHDPLPIFTVDADERLALWAVFHTRDL 
HLGETSLRLIMDNLPNYDIT\mCIKQTYIMKFTPSRPDNATVT\^VNSICEAVATLDC 

TDEFREEIQRGTAI INSQGLL " 

SEQ ID NO 154 CDS complement ( 1192 11 119735 ) 

/note="FLIP; ORF 71; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 71" 

SEO ID NO 155 /translation="MFPHKRLVDFGRHLEADDREAVI.WLFDRPASDDTPEGFANGLCP 
^ STGEPGIPLPVLLEAVFLVGRLDLVSTFFLLDVGFIIERLRSSPSYFSPYKHLMLSID 

RQLSERDVKNLVFLTGDQLGRRRNQSPTFFRWLSQMEKAALVSPSNYMVLSDLLQAVS 



56 



O '^rijJJ Jl MS O O m O :iU E Ui JL 



WO 00/28040 PCT/US99/26260 



RRDVAKWAANAPG " 

SEQ ID NO 156 CDS complement (119794 .. 120558 ) 

/note="cyclin D homolog; ORF 72; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 72" 

SEQ ID NO 157 /trans lat ion= "MASVGPVPTGTIDPVLYQDRALSNLLAHEASFWSTACYGTIQT 

EVTVGMRVILGTWMRS VARAHQADASVFPLAVS lUDRYLECRS I PRRRFQRLGAACLF 
LAGKIRDI^PFKAAFLCFCAAEDFSVADLLKQEKSILK^UjRWKLEAVliPTDAIGPTIiF 
KSGFTKEQLFALHSQWESVHKAIVNPATGGLPPSLVAAACALFSLGAAAPPPARIiAE 
AVGVSAATIiAAAAESVATTLREFDEDHILSNARGSS" 

SEQ ID NO 158 CDS complement (120866 .. 122212 ) 

/note=" latent nuclear antigen; ORF 73; similar to Kaposi's 
sarcoma-associated herpesvirus ORF 73" 

SEQ ID NO 159 / trans 1 at ion= "MWGSRQHRSGIVSGHGLRSSCRGHCGRRGGTREQAGRRGRGRGT 

AAPAAAPAPPAPTTSGPQVRAVAEQGHGSDTETATESRHGSSQGSPSGSGSESVIVLG 
SPTPSPSGSAPVL.ASGLSPQNTSGSSPASPASHSPPPSPPSHPGPHSPAPPSSHNPSP 
NQQPSSFLQPSHHDSPEPPEPPTSLPPPDSPGPPQSPTPTSSPPPQSPPDSPGPPQSP 
TPQQAPSPNTQQAVSHTDHPTGPSRPGPPFPGHTSHSYTVGGWGPPTRAGGVPCLRIiR 
CTSHNSHEDEAPERQQEQEGEERQQQPARPPRPPRPPRYP IPX P YPSSEEEVPRKYRP 
QRRFYRQVLGPRIDPPRPGPWCHGVIFCNSDPYSLYRLARCLQFPGIRASSVRVLPDA 
PGSPVIPAFCI TVFCQS RGTAKAVKKARRRWERHH PS APHFQ AS I VRMDRGLP I QH " 

SEQ ID NO 160 CDS 122 866 123 627 

/note="R15; similar to Kaposi's sarcoma-associated 
herpesvirus K14 and ox-2" 

SEQ ID NO 161 /translation="MSGGITLTIJJLATIiATWCAI.QTHYAAVPVHSTASLGCVLTTPH 

DVLIVTWQKQESPSPVNVATYSSEAGTWQPPFAGRVDIPEHKLTRTTLKFFNATIjED 
EGCYLCIFNAFGVGKLSGTACLTVYVPLSMSVTFYPPINPTQLVCRAEASPAPSVNWT 
GVPPELCSEPEVFPRPNGTTLWGRCNVTSVDPEDLENATCLVTHIGGIiAAARPIiDPV 
FSDPLEGTSHYWGWAAAAVLGIFLTGVFLYRSM " 

SEQ ID NO 162 CDS 123 924 124 952 

/note="G protein coupled receptor; ORF 74; similar to 
Kaposi's sarcoma-associated herpesvirus ORF 74" 

SEQ ID NO 163 / trans la tion= "r4DAI.NNNI^LLMDFIiSNYSNSYSSYDDNMSYTIJDTESTLCRLTV 

VFPPTVYAI I C FF I FC I TLFGNALVL YI FFKFKALANS VDVLMAGLCCNSLFLCAS FL 
FSWLLYVAPQMLTSATCKVEIFFFYLYTYFGVYIVVCISLIRCLLWFSRRPWVKHGA 
SGFLCVCVSLIVALAiSANASLYRTALRHPETSEWICYEDAGEDTVNWKLRIRTTSAI 
CGFLVPFGLMVLFYGLTWC^WKSTKIJ^KGAVRGVIVTVVVLFLIFCLPYHLCN 
LLRTGFLAETCYLRDVISVAMHICSLLQSMYSAFVPWYSGLGSLFRRRVRDTWSVFR 

CFSTSGSli" 

SEQ ID NO 164 CDS complement (125057 .. 128953 ) 

/note=" tegument protein; FGARAT; ORF 75; similar to 
Kaposi's sarcoma-associated herpesvirus ORF 75" 

SEQ ID NO 165 /trans lation= "MAQRTNPRWAAAALSPEEEAFIHDNSDAESVLALVPEQCFSEFL 

LWLVTRPSDNFDNDDDDPALGVIWHLLAPLVNYAPLETRSAHLQGHHTISLPYGPDLM 
RQPTTRSSEIVQCLRDSGLDRTLRLEVGRHIiSCQTRRFVADRVPPGTLAAIiTIiGTIiVE 
YDVRVQRQLPVTLQSTAWRPLPERDPICAAVMLPLQRNILPIiAVQASNGNSYTVSRYA 
VMARRSYSCVFQRLPCENVTHIADSFTHLHSAIQTGAGALQNILiFHATLLPGGEIRSA 
LCGFYATTPSVGAFSRARHRAINTTATLHCQOLARTGTPVLGGFLKTVHSATTSEANV 
ITTTSLLSCVPQAYTFLRRSLFSQPIICIiGSFEPVDGDGNQRSLYLGSAAGITRITQT 
LSLAYEILEGPLFTSINRAHEPASVIGHLGALVSRGGLRLFVSQIiPPTILSQLTATPD 
ISRETVNDILVNKFLNVSACVVFAVLPRDTEPEPGPIiDAIRRAARICGCPFAVVGETC 
EELGIQFVNDLELWNPGAWPIRQPTSAEVIATFGFDEQPVSSNWLVRPEEPEDGGEQA 
PSPTDWGLFRLASVVDQLLRCPTVGSKEFVTRHVDRCSNGLVAQQCEVGPL.GRPLSDY 
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HIVNHTSVFTDRMARVPIYRPQPITRQDATElUiVSPETV^QGRGRNRVA^GQCVAYGE 
Q AY KMG INAAVGAR Y AI CEAVTN I MLAHVRRLSD I TLT AS VG WNPEDDQAWLLQHTL F- 
ACKELCRDLSINFAITSAGSTPCLSEELISATQQHQTVAPVPFNAWITATAEVKSSR 
QRVTPDLKATGNLIVLVSFPVPHLTQGSTFEHLCLLPSPTLPDVQATHLANLFMLTEA 
LLSRGIiWSGHDVSDGGIWVTAIEMALAGNRGIiQIRIPSEETPLQWLVSETPGVIFEI 
QPQHVDEVRQACQNFDCRATVCGTVGQEGLSERIVISHNNDEVYSQTLTSVAANWTSF 
SDEQWYSWGPSFTPAQELYRKDYGCNQHNLGHLAEVCRWSELTLFATPSRPPAVAALV 
TPGAPLPRAL^IAAFT^^/GFDVAAVSTDDIiRGGNILRGFSGLTIGGTVGIEDSYVGARC 
AIMGLIiNDPGCYGGL^4AFFRRADTFSLCCGEFGFQLLGALGLX.RETPHDTPGPKTPDQ 
WDIHLEENASGNHECLWIJtTIJIIPQTTISIMFRVLRGLVI.PGWANGRYLGVRYPRDAME 
YHLNQQQRIAIiNYHTGNADPRMFAQHYPRNPSANSAVAAITSPDGRHLASLVDPAVTF 
HPWQWAYVPPELADMTVS PWALAFQSLFLWC IRNRQ " 



SEQ ID NO 166 
PGR primer 

CCTATGGGCTCCATGAGC 

SEQ ID NO 167 
PGR primer 

ATCGTCAATCAGGCTGCG 

SEQ ID NO 168 
PGR primer 

ATATTAAACACTCGCCGC 

SEQ ID NO 169 
PGR primer 

ATGAGGGGCCTTTTCGTGTGC 

SEQ ID NO 17 0 
PGR primer 

CTGAATCCCGCTGCCAAGGCC 

SEQ ID NO 171 
PGR primer 

ATGTTCCCTGTCTGGTTCGTC 

SEQ ID NO 172 
PGR primer 

TTACATCATAGCTATTGCGCG 
SEQ ID NO 173 

<213> Macaca mulatta rhadinovirus 17577 
nucleotides complement (233 98 . .23668) 
SEQ ID NO 174 

<213> Macaca mulatta rhadinovirus 17577 
nucleotides 25065.. 25368 
SEQ ID NO 175 

<213> Macaca mulatta rhadinovirus 17577 
nucleotides 25518.. 26525 
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SEQ ID NO 17 6 

<213> Macaca mulatta rhadinovirus 17577 
nucleotides 114979 115383 
SEQ ID NO 177 

<213> Macaca mulatta rhadinovirus 17577 
nucleotides 115385 116413 

SEQ ID NO 17 8 

<213> Macaca mulatta rhadinovirus 17577 
nucleotides 131731 131926 
SEQ ID NO 17 9 

<213> Macaca mulatta rhadinovirus 17577 
nucleotides 132333 133719. 
partial terminal repeat 
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